Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add check for images' alt text #2348

Closed
wants to merge 4 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .github/workflows/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,8 @@ jobs:

- name: File metadata
run: npm run check:metadata
- name: Check image alt text
run: npm run check:alt-text
- name: Spellcheck
run: npm run check:spelling
- name: Check Qiskit bot config
Expand Down
14 changes: 14 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -289,6 +289,20 @@ npm run check:metadata -- --apis
npm run check
```

## Check image alt text

Every image needs to have alt text for accessibility. The `lint` job in CI will fail if images do not have alt text defined.

You can check it locally by running:

```bash
# Only check alt text
npm run check:alt-text

# Or, run all the checks
npm run check
```

## Spellcheck

We use [cSpell](https://cspell.org) to check for spelling. The `lint` job in CI will fail if there are spelling issues.
Expand Down
4 changes: 2 additions & 2 deletions docs/guides/custom-transpiler-pass.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,7 @@
" qc.draw(output='mpl')\n",
"\n",
"```\n",
"![The circuit's DAG consists of nodes that are connected by directional edges. It is a visual way to represent qubits or classical bits, the operations, and the way that data flows. ](/images/guides/custom-transpiler-pass/DAG_circ.png \"DAG\")\n",
"![Circuit preparing a Bell state and applying an $R_Z$ rotation depending on the measurement outcome.](/images/guides/custom-transpiler-pass/DAG_circ.png \"Circuit\")\n",
"\n",
"Use the `qiskit.tools.visualization.dag_drawer()` function to view this circuit's DAG. There are three kinds of graph nodes: qubit/clbit nodes (green), operation nodes (blue), and output nodes (red). Each edge indicates data flow (or dependency) between two nodes.\n",
"\n",
Expand All @@ -83,7 +83,7 @@
"dag = circuit_to_dag(qc)\n",
"dag_drawer(dag)\n",
"```\n",
"![](/images/guides/custom-transpiler-pass/DAG.png)\n",
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This image was missing the alt text. Looking at the notebook (https://docs.quantum.ibm.com/guides/custom-transpiler-pass) I think the alt text of the previous image (the circuit inside the "Background: DAG representation" collapsible) refereed to this one, so I moved it here and added an alt text for the circuit

"![The circuit's DAG consists of nodes that are connected by directional edges. It is a visual way to represent qubits or classical bits, the operations, and the way that data flows.](/images/guides/custom-transpiler-pass/DAG.png \"DAG\")\n",
"</details>"
]
},
Expand Down
3 changes: 2 additions & 1 deletion package.json
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,8 @@
"fmt": "prettier --write .",
"test": "playwright test",
"typecheck": "tsc",
"check": "npm run check:qiskit-bot && npm run check:patterns-index && npm run check:metadata && npm run check:spelling && npm run check:internal-links && npm run check:orphan-pages && npm run check:fmt",
"check": "npm run check:qiskit-bot && npm run check:patterns-index && npm run check:alt-text && npm run check:metadata && npm run check:spelling && npm run check:internal-links && npm run check:orphan-pages && npm run check:fmt",
"check:alt-text": "tsx scripts/js/commands/checkAltText.ts",
"check:metadata": "tsx scripts/js/commands/checkMetadata.ts",
"check:spelling": "tsx scripts/js/commands/checkSpelling.ts",
"check:fmt": "prettier --check .",
Expand Down
43 changes: 43 additions & 0 deletions scripts/js/commands/checkAltText.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
// This code is a Qiskit project.
//
// (C) Copyright IBM 2024.
//
// This code is licensed under the Apache License, Version 2.0. You may
// obtain a copy of this license in the LICENSE file in the root directory
// of this source tree or at http://www.apache.org/licenses/LICENSE-2.0.
//
// Any modifications or derivative works of this code must retain this
// copyright notice, and modified files need to carry a notice indicating
// that they have been altered from the originals.

import { globby } from "globby";
import { findImagesWithoutAltText } from "../lib/extractMarkdownImages.js";
import { readMarkdown } from "../lib/markdownReader.js";

async function main() {
const files = await globby(["docs/**/*.{ipynb,mdx}", "!docs/api/**/*.mdx"]);
const fileErrors: string[] = [];

for (const file of files) {
const markdown = await readMarkdown(file);
const images = await findImagesWithoutAltText(markdown);
const imageErrors = [...images].map(
(image) => `The image '${image}' does not have an alt text defined.`,
);

if (imageErrors.length) {
fileErrors.push(
`Error in file '${file}':\n\t- ${imageErrors.join("\n\t- ")}`,
);
}
}

if (fileErrors.length) {
fileErrors.forEach((error) => console.log(error));
console.error("\nSome images are missing alt text 💔\n");
process.exit(1);
}
console.log("\nAll images have alt text ✅\n");
}

main().then(() => process.exit());
40 changes: 40 additions & 0 deletions scripts/js/lib/extractMarkdownImages.test.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
// This code is a Qiskit project.
//
// (C) Copyright IBM 2023.
//
// This code is licensed under the Apache License, Version 2.0. You may
// obtain a copy of this license in the LICENSE file in the root directory
// of this source tree or at http://www.apache.org/licenses/LICENSE-2.0.
//
// Any modifications or derivative works of this code must retain this
// copyright notice, and modified files need to carry a notice indicating
// that they have been altered from the originals.

import { expect, test } from "@playwright/test";

import { findImagesWithoutAltText } from "./extractMarkdownImages.js";

test("Test the extraction of the images", async () => {
const markdown = `
# A header

![Our first image with alt text](/images/img1.png)

![](/images/invalid_img1.png)

![Here's another valid image](/images/img2.png)

![](/images/invalid_img2.png)

![](/images/invalid_img2.png)

![And now, our last link](https://ibm.com)
`;
const images = await findImagesWithoutAltText(markdown);
const correct_images = new Set([
"/images/invalid_img1.png",
"/images/invalid_img2.png",
]);

expect(images).toEqual(correct_images);
});
39 changes: 39 additions & 0 deletions scripts/js/lib/extractMarkdownImages.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
// This code is a Qiskit project.
//
// (C) Copyright IBM 2024.
//
// This code is licensed under the Apache License, Version 2.0. You may
// obtain a copy of this license in the LICENSE file in the root directory
// of this source tree or at http://www.apache.org/licenses/LICENSE-2.0.
//
// Any modifications or derivative works of this code must retain this
// copyright notice, and modified files need to carry a notice indicating
// that they have been altered from the originals.

import { unified } from "unified";
import { Root } from "remark-mdx";
import { visit } from "unist-util-visit";
import remarkParse from "remark-parse";
import remarkGfm from "remark-gfm";
import remarkStringify from "remark-stringify";

export async function findImagesWithoutAltText(
markdown: string,
): Promise<Set<string>> {
const images = new Set<string>();

await unified()
.use(remarkParse)
.use(remarkGfm)
.use(() => (tree: Root) => {
visit(tree, "image", (node) => {
if (!node.alt) {
images.add(node.url);
}
});
Comment on lines +29 to +33
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI: In my opinion, the visitor is already very simple and readable, but I discovered that we can add more conditions to the typeNode part to make it more concise. Equivalent example:

 visit(
        tree,
        (node) => node.type === "image" && !node.alt,
        (node) => images.add(node.url),
      );

I don't think is useful right now, but we could use it in the future if the logic becomes more complex.

})
.use(remarkStringify)
.process(markdown);
Comment on lines +25 to +36
Copy link
Collaborator Author

@arnaucasau arnaucasau Nov 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Notice that the plugin will not extract images from HTML tags. I have searched in the repo and we don't have any instance of it. HTML tags can be used for the writers if they want to resize the images in markdown:

<img src="example.jpg" alt="Example" width="200"/>

We could implement this by adding another visitor that uses Cheerio as follows:

visit(tree, "html", (node) => {
  const $ = load(node.value);
  const imgElement = $("img");
  if (imgElement.length) {
    images.add(imgElement.attr("src"));
 }
});

I think it would be good to add it to make it more robust, but what do you think?


return images;
}
Loading