Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix running issues. #6

Open
wants to merge 14 commits into
base: master
Choose a base branch
from
Open

Conversation

w32zhong
Copy link

related to issue #5

@@ -23,7 +23,8 @@ def get_collection(self, ):
temp_address = root+"/"+directory+"/"
if not os.path.isdir(temp_address):
continue
temp_address = temp_address +"/Articles"
if os.path.exists(temp_address + "/Articles"):
Copy link

@norbertstrzelecki norbertstrzelecki Sep 10, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks like this one / might be too much here (or at the end of the temp_address = root+"/"+directory+"/" line) since together they convert to temp_address//Articles and there is an error when running python3 tangent_cft_front_end.py -ds "/NTCIR12_MathIR_WikiCorpus_v2.1.0/MathTagArticles" ...

```sh
for x in *.tar.bz2; do echo $x; tar xjf $x; done
```

Also one can easily use anydataset, such as [Math Stach Exchange] (https://math.stackexchange.com/), in form of csv file of latex formula and formula ids (separated by $$ sign) to train a new model.
Copy link

@norbertstrzelecki norbertstrzelecki Sep 13, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It could be helpfull to provide either a more specific link to the mentioned dataset or any example of such .csv file.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what I have mentioned is refering the link above: https://www.cs.rit.edu/~rlaz/NTCIR-12_MathIR_Wikipedia_Corpus.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants