feat: bump RAS format to 1.3 #247

deltork · 2024-10-10T13:52:21Z

PR Goal?

To fix an issue created by the editor

Fixes?

ReadAlongs/Studio-Web#347

Feedback sought?

sanity check

Priority?

High

Tests added?

yes

How to test?

Confidence?

High

Version change?

no

semanticdiff-com · 2024-10-10T13:52:24Z

Review changes with SemanticDiff.

Analyzed 6 of 10 files.

Overall, the semantic diff is 47% smaller than the GitHub diff.

	Filename	Status
✔️	test/test_dtd.py	Analyzed
❔	test/data/ras-dtd-1.1.readalong	Unsupported file format
❔	test/data/ras-dtd-1.3.readalong	Unsupported file format
✔️	readalongs/_version.py	61.26% smaller
✔️	readalongs/align.py	Analyzed
✔️	readalongs/web_api.py	Analyzed
✔️	readalongs/text/make_package.py	58.01% smaller
✔️	readalongs/text/util.py	11.36% smaller
❔	readalongs/static/read-along-1.3.dtd	Unsupported file format
❔	docs/cli-guide.md	Unsupported file format

codecov · 2024-10-10T14:48:50Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.12%. Comparing base (eec5662) to head (4cf9841).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #247      +/-   ##
==========================================
- Coverage   87.51%   87.12%   -0.40%     
==========================================
  Files          21       21              
  Lines        1786     1786              
  Branches      323      323              
==========================================
- Hits         1563     1556       -7     
- Misses        185      191       +6     
- Partials       38       39       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

joanise

Nice work, but requires a few changes.

For CI to pass on Windows (see https://github.com/ReadAlongs/Studio/actions/runs/11276460698/job/31360285739?pr=247) you also need to update test/data/cs-ref.readalong. I should change that test so it only looks at the line where not outputting utf-8 correctly would make a difference, instead of diffing the whole file...

More comments inline.

readalongs/static/read-along-1.3.dtd

joanise · 2024-10-10T18:36:49Z

readalongs/static/read-along-1.3.dtd

+  dur CDATA #IMPLIED
+  annotation-id CDATA #IMPLIED
+  sentence-id CDATA #IMPLIED
+  xmlns CDATA #IMPLIED>


I'm not sure why we need xmlns here. I get it for the top-level read-along element, and I would get it if every element needed to allow it, but only one additional element I don't get.

joanise · 2024-10-10T18:37:05Z

readalongs/static/read-along-1.3.dtd

+  id CDATA #IMPLIED
+  class CDATA #IMPLIED
+  do-not-align CDATA #IMPLIED
+  ARPABET CDATA #IMPLIED


can we make this case insensitive?

I was thinking the same. However, I am worried about backwards compatibility. Attributes in HTML are case insensitive but are case sensitive in XML (.readalong is a subset of XML). We need a good look at the whole pipeline before we make the switch. We use both XML and HTML parsers in the various parts of the pileline. As far as I know, only the CLIs consume the ARPABET attribute.

Maybe an option is to put both ARPABET and arpabet is the dtd?

joanise · 2024-10-10T18:37:11Z

readalongs/static/read-along-1.3.dtd

+  id CDATA #IMPLIED
+  class CDATA #IMPLIED
+  do-not-align CDATA #IMPLIED
+  ARPABET CDATA #IMPLIED


joanise · 2024-10-10T18:40:04Z

readalongs/text/make_package.py

@@ -20,13 +20,17 @@

 from lxml import etree

-from readalongs._version import VERSION
+from readalongs._version import CURRENT_WEB_APP_VERSION, VERSION


nice idea, moving CURRENT_WEB_APP_VERSION to _version.py and reusing it here.

feat: bump RAS format to 1.3

7f650f6

deltork linked an issue Oct 10, 2024 that may be closed by this pull request

DTD 1.3 #246

Open

deltork requested review from joanise and roedoejet October 10, 2024 13:52

fix: xmlns added to s tag ( RAS format to 1.3)

4cf9841

deltork mentioned this pull request Oct 10, 2024

fix: cannot download other formats ReadAlongs/Studio-Web#351

Merged

joanise requested changes Oct 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: bump RAS format to 1.3 #247

feat: bump RAS format to 1.3 #247

deltork commented Oct 10, 2024

semanticdiff-com bot commented Oct 10, 2024 •

edited

Loading

codecov bot commented Oct 10, 2024

joanise left a comment

joanise Oct 10, 2024

joanise Oct 10, 2024

deltork Oct 17, 2024

joanise Oct 17, 2024

joanise Oct 10, 2024

joanise Oct 10, 2024

feat: bump RAS format to 1.3 #247

Are you sure you want to change the base?

feat: bump RAS format to 1.3 #247

Conversation

deltork commented Oct 10, 2024

PR Goal?

Fixes?

Feedback sought?

Priority?

Tests added?

How to test?

Confidence?

Version change?

semanticdiff-com bot commented Oct 10, 2024 • edited Loading

codecov bot commented Oct 10, 2024

Codecov Report

joanise left a comment

Choose a reason for hiding this comment

joanise Oct 10, 2024

Choose a reason for hiding this comment

joanise Oct 10, 2024

Choose a reason for hiding this comment

deltork Oct 17, 2024

Choose a reason for hiding this comment

joanise Oct 17, 2024

Choose a reason for hiding this comment

joanise Oct 10, 2024

Choose a reason for hiding this comment

joanise Oct 10, 2024

Choose a reason for hiding this comment

semanticdiff-com bot commented Oct 10, 2024 •

edited

Loading