blog: add Test shell scripts using Python (#44)

Oreoxmt · Aug 23, 2023 · e92c709 · e92c709
1 parent 5eb425b
commit e92c709
Show file tree

Hide file tree

Showing 5 changed files with 715 additions and 1 deletion.
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,3 @@
+.idea
+.vscode
+.zed
diff --git a/website/blog/2023-08-23-python-test-shell-scripts.md b/website/blog/2023-08-23-python-test-shell-scripts.md
@@ -0,0 +1,356 @@
+---
+slug: 2023-08-23-python-test-shell-scripts
+title: "如何用 Python 测试 Shell 脚本，顺便写一个测试框架"
+authors: [Oreo]
+tags: [Python, Test, Shell]
+---
+
+# 如何用 Python 测试 Shell 脚本，顺便写一个测试框架
+
+本文以 [pingcap-docsite-preview](https://github.com/Oreoxmt/pingcap-docsite-preview) 项目中的测试框架为例，介绍如何使用 Python 测试 Shell 脚本。
+
+**pingcap-docsite-preview** 是一个用于预览文档内容的项目。它使用一些 Shell 脚本更新文档网站的内容。为了确保 Shell 脚本的代码质量和功能正确性，该项目使用 Python 构建了一个测试 Shell 脚本的框架。
+
+下面以测试 `sync_scaffold.sh` 为例，该脚本用于更新 `markdown-pages` 目录以及 `docs.json` 文件内容。
+
+<!--truncate-->
+
+## 问题拆解
+
+在设计测试框架之前，首先按照如下思路拆解问题。
+
+1. 测试是什么？
+
+  测试是为了验证程序是否符合需求，常见的验证方法是判断程序的实际运行结果是否与预期结果一致。所以，核心在于**比较**。
+
+2. 如何比较？
+
+  比较的具体技术实现取决于要比较的内容格式。如果程序返回的是 status code，可直接判断是否相等。在 `sync_scaffold.sh` 脚本中，返回的是文件。此时可以比较文件夹结构或文件内容。项目中使用 **[`diff`](https://linux.die.net/man/1/diff)** 命令比较文件内容。
+
+3. 如何获取要比较的内容？
+
+   - 预期的输出结果：人工编写
+   - 实际的输出结果：**程序运行**后产生
+
+4. 在哪里运行程序？
+
+  在开发过程中，我们可以在本地运行 Shell 脚本。但是，在测试中，为了保证测试的可重复性，我们需要准备一个**测试环境**用于运行 Shell 脚本。该测试环境需要包含 Shell 脚本和输入数据。
+
+  在测试 `sync_scaffold.sh` 脚本时，需要准备以下内容：
+
+   - `sync_scaffold.sh` 脚本并且有执行权限
+   - `sync_scaffold.sh` 的输入，即 `markdown-pages` 目录和 `docs.json` 文件
+
+   :::note
+
+   由于 `sync_scaffold.sh` 中的 `rsync` 命令可以自动创建 `markdown-pages` 目录，同时 `cp` 命令可以自动创建 `docs.json` 文件，所以在测试中可以使用空的输入数据。
+
+   :::
+
+5. 整体测试的工作流是什么？
+
+   1. 准备测试环境：
+      - 确保测试环境有且仅有测试所需的内容：`cp -r sync_scaffold.sh data/`
+      - 确保测试环境中的脚本有执行权限：`chmod +x sync_scaffold.sh`
+   2. 执行程序：`./sync_scaffold.sh`
+   3. 比较程序运行的实际结果 (`data/`) 和预期结果 (`actual/`)：`diff -r data actual`
+
+6. **如何在 Python 中实现测试？**
+
+   - 使用 [`shutil.rmtree()`](https://docs.python.org/3/library/shutil.html#shutil.rmtree) 清理测试环境
+
+    ```python
+    import shutil
+    import os
+
+    if os.path.exists(test_output):
+      shutil.rmtree(test_output)
+    os.makedirs(test_output, exist_ok=True)
+    ```
+
+   - 使用 [`shutil.copy()`](https://docs.python.org/3/library/shutil.html#shutil.copy) 复制测试脚本
+
+    ```python
+    import shutil
+    shutil.copy(script, test_feature_path)
+    ```
+
+   - 使用 [`os.chmod()`](https://docs.python.org/3/library/os.html#os.chmod) 确保脚本有执行权限
+
+    ```python
+    import os
+    os.chmod(script, 0o755)
+    ```
+
+   - 使用 [`subprocess`](https://docs.python.org/3/library/subprocess.html) 执行 Shell 脚本
+
+    ```python
+    import subprocess
+    process = subprocess.Popen(test_feature_path)
+    code = process.wait()
+    ```
+
+## 测试框架的代码实现
+
+### 定义成员
+
+`DocSitePreviewTest` 类的成员变量如下：
+
+- `test_dir`：测试环境所在的目录
+- `feature_dir`：待测试脚本所在的目录
+- `script_name`：待测试脚本的名称
+- `test_output`：测试环境的输出目录 (`test_dir/actual`)，用于存放实际输出
+- `test_feature_path`：测试环境中待测试脚本的路径 (`test_output/script_name`)
+
+```python title=test/test_util.py
+class DocSitePreviewTest:
+
+  def __init__(self, test_dir: str, feature_dir: str, script_name: str):
+    self.feature_dir = feature_dir
+    self.script_name = script_name
+    self.test_dir = test_dir
+    self.test_output = os.path.join(self.test_dir, "actual")
+    self.test_feature_path = os.path.join(self.test_output, self.script_name)
+    self._setup_test_env()
+```
+
+### 准备测试环境
+
+在初始化 `DocSitePreviewTest` 类时，会调用 `_setup_test_env()` 方法，该方法会执行以下操作：
+
+1. `DocSitePreviewTest._clean_up()` 清理测试环境，即确保 `test_output` 目录的内容为空
+2. `DocSitePreviewTest._copy_and_setup_script()` 复制待测试脚本到测试环境中
+3. `DocSitePreviewTest._make_script_executable()` 确保待测试脚本有执行权限
+
+```python title=test/test_util.py
+class DocSitePreviewTest:
+
+  def _setup_test_env(self):
+    """
+    Generate the test environment for execution.
+    1. Clean up the test environment.
+    2. Copy the target script to the test environment.
+    """
+    self._clean_up()
+    self._copy_and_setup_script()
+
+  def _clean_up(self):
+    """
+    Clean up the test environment.
+    """
+    if os.path.exists(self.test_output):
+        shutil.rmtree(self.test_output)
+    os.makedirs(self.test_output, exist_ok=True)
+
+  def _copy_and_setup_script(self):
+    """
+    Copy the script to the test environment.
+    """
+    shutil.copy(os.path.join(self.feature_dir, self.script_name), self.test_feature_path)
+    self._make_script_executable(self.test_feature_path)
+
+  @staticmethod
+  def _make_script_executable(script: str):
+    """
+    Make the script executable (chmod +x).
+    """
+    os.chmod(script, 0o755)
+```
+
+### 执行待测试脚本
+
+`DocSitePreviewTest.execute()` 方法用于执行待测试脚本，其中：
+
+- `args`：待测试脚本的参数
+- `env`：待测试脚本的环境变量
+
+```python title=test/test_util.py
+class DocSitePreviewTest:
+
+  def execute(self, args: str = "", env: dict | None = None):
+    """
+    Execute the feature command.
+    """
+    command_str = self.test_feature_path + " " + args
+    command_list = shlex.split(command_str)
+    self._execute_command(command_list, self.test_output, env)
+
+  @staticmethod
+  def _execute_command(command, cwd, env=None):
+    """
+    Execute a command and check its exit code.
+    Raise an exception if the command does not return 0.
+    """
+    process = subprocess.Popen(command, cwd=cwd, env=env)
+    code = process.wait()
+    if code != 0:
+        raise Exception("Error: command returned code {}".format(code))
+```
+
+使用示例：
+
+```python
+from test_util import DocSitePreviewTest
+
+script_name: str = "sync_scaffold.sh"
+script_args: str = "main"
+
+script_env = os.environ.copy()
+script_env["TEST"] = str(1)
+
+test = DocSitePreviewTest(test_dir, feature_dir, script_name)
+# highlight-next-line
+test.execute(args=script_args, env=script_env)
+```
+
+### 验证输出
+
+`DocSitePreviewTest.verify()` 方法用于验证待测试脚本的输出，其中 `command` 为 `diff` 命令，例如 `diff -r data actual`。
+
+```python title=test/test_util.py
+class DocSitePreviewTest:
+
+  @staticmethod
+  def _execute_command(command, cwd, env=None):
+    """
+    Execute a command and check its exit code.
+    Raise an exception if the command does not return 0.
+    """
+    process = subprocess.Popen(command, cwd=cwd, env=env)
+    code = process.wait()
+    if code != 0:
+        raise Exception("Error: command returned code {}".format(code))
+
+  def verify(self, command: str = "diff -r data actual"):
+    """
+    Use diff command to compare the expected output (data) and the actual output.
+    """
+    args = shlex.split(command)
+    self._execute_command(args, self.test_dir)
+    print("Test {} passed successfully".format(self.script_name))
+```
+
+使用示例：
+
+```python
+from test_util import DocSitePreviewTest
+
+# highlight-next-line
+diff_command_line: str = f"diff -r data actual --exclude temp --exclude {script_name}"
+
+test = DocSitePreviewTest(test_dir, feature_dir, script_name)
+# highlight-next-line
+test.verify(diff_command_line)
+```
+
+## 测试框架的使用
+
+### 目录结构
+
+这个项目的目录结构如下，其中功能脚本 `sync_scaffold.sh` 位于根目录下，`test` 为测试目录：
+
+```bash
+.
+├── sync_scaffold.sh
+└── test
+    ├── sync_scaffold
+    │   ├── actual
+    │   ├── data
+    │   └── test_sync_scaffold.py
+    └── test_util.py
+```
+
+### 测试新功能
+
+在这个项目中，基于 `DocSitePreviewTest` 对新功能进行测试只需要准备以下内容：
+
+1. 创建功能测试目录，例如 `test/sync_scaffold`
+2. 准备测试功能的预期输出数据 `test/sync_scaffold/data`
+3. 编写测试功能的 Python 脚本，例如 `test/sync_scaffold/test_sync_scaffold.py`
+
+   - 提供待测试脚本的名称 `script_name`
+   - 提供待测试脚本的参数 `script_args`
+   - 提供待测试脚本的环境变量 `script_env`
+   - 提供验证功能的 `diff` 命令 `diff_command_line`
+
+配置步骤如下图所示：
+
+![Test Process](/img/docsite_preview_test_process.png)
+
+### 示例
+
+下面是测试 `test_sync_scaffold.sh` 功能对应的 Python 脚本：
+
+```python title=test/sync_scaffold/test_sync_scaffold.py
+import sys
+import os
+
+test_dir: str = os.path.dirname(os.path.abspath(__file__))
+feature_dir: str = os.path.dirname(os.path.dirname(test_dir))
+# highlight-start
+sys.path.append(os.path.dirname(test_dir))
+
+from test_util import DocSitePreviewTest
+# highlight-end
+
+script_name: str = "sync_scaffold.sh"
+script_args: str = "main"
+diff_command_line: str = f"diff -r data actual --exclude temp --exclude {script_name}"
+
+script_env = os.environ.copy()
+script_env["TEST"] = str(1)
+
+test = DocSitePreviewTest(test_dir, feature_dir, script_name)
+
+test.execute(args=script_args, env=script_env)
+
+test.verify(diff_command_line)
+```
+
+为什么上面的代码示例中需要先 `sys.path.append(os.path.dirname(test_dir))` 再 `from test_util import DocSitePreviewTest`？
+
+如果在 `test/sync_scaffold/test_sync_scaffold.py` 开头直接 `from test_util import DocSitePreviewTest` 会发生什么？
+
+```bash
+python3 test/sync_scaffold/test_sync_scaffold.py
+Traceback (most recent call last):
+  File "test/sync_scaffold/test_sync_scaffold.py", line 3, in <module>
+    from test_util import DocSitePreviewTest
+ModuleNotFoundError: No module named 'test_util'
+```
+
+会报错 `ModuleNotFoundError: No module named 'test_util'`。
+
+这是因为 Python 解释器处理 `import` 语句时会在 `sys.path` 列表中的目录里查找这个模块。在执行 `test_sync_scaffold.py` 时，初始化的 `sys.path` 列表中会包含 `test/sync_scaffold` 目录，而 `test_util.py` 不在 `test/sync_scaffold` 目录中，所以会报错。
+
+更多信息请参考 [The initialization of the sys.path module search path](https://docs.python.org/3/library/sys_path_init.html)。
+
+:::tip
+
+为什么是 `sys.path`？
+
+> [By default, this package provides built-in support for discovery of metadata for file system and zip file Distribution Packages. **This metadata finder search defaults to `sys.path`**, but varies slightly in how it interprets those values from how other import machinery does.](https://docs.python.org/3/library/importlib.metadata.html#distribution-discovery)
+
+对应 `importlib.metadata` 的源码 [`importlib/metadata/__init__.py`](https://github.com/python/cpython/blob/3.11/Lib/importlib/metadata/__init__.py#L758-L766)。
+
+:::
+
+为了解决这个问题，需要将 `test_util.py` 所在的目录 `test` 添加到 `sys.path` 中。
+
+```python
+import sys
+import os
+
+print(sys.path)
+test_dir: str = os.path.dirname(os.path.abspath(__file__))
+feature_dir: str = os.path.dirname(os.path.dirname(test_dir))
+sys.path.append(os.path.dirname(test_dir))
+print(sys.path)
+```
+
+执行结果如下：
+
+```bash
+['/Projects/test/sync_scaffold', '/opt/homebrew/Cellar/[email protected]/...']
+['/Projects/test/sync_scaffold', '/opt/homebrew/Cellar/[email protected]/...', '/Projects/test']
+```
-Original file line number
+Diff line change
@@ -0,0 +1,3 @@
+    .idea
+    .vscode
+    .zed