Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recover from various issues #34

Merged
merged 8 commits into from
Aug 14, 2014
Merged

Recover from various issues #34

merged 8 commits into from
Aug 14, 2014

Conversation

LFDM
Copy link
Member

@LFDM LFDM commented Aug 14, 2014

Closes #33

We have problems when documents are not in sync regarding their token structure. This can happen even with valid documents, such as when Arethusa inserts elliptic nodes.

There were also errors when partially unannotated documents were supplied (missing attributes, or in the case of the postag also an empty string). This is also taken care of here.

While these minor changes keep the application running and not throwing exceptions, there are some follow-up problems with it. A new issue will deal with them.

LFDM added 8 commits August 14, 2014 11:20
Documents might contain slightly out of sync token structures, when
artificial tokens were inserted in one but not the other.

To keep the program running, we enter something like an empty word,
which is effective, but can also be problematic in the final reports:
When a word is present only in the reviewable file, we cannot keep
track of it in the diff report.
... and put something that indicates emptiness ('-') there.
LFDM added a commit that referenced this pull request Aug 14, 2014
@LFDM LFDM merged commit 26ecae6 into master Aug 14, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Make this more robust
1 participant