-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix running issues. #6
base: master
Are you sure you want to change the base?
Conversation
@@ -23,7 +23,8 @@ def get_collection(self, ): | |||
temp_address = root+"/"+directory+"/" | |||
if not os.path.isdir(temp_address): | |||
continue | |||
temp_address = temp_address +"/Articles" | |||
if os.path.exists(temp_address + "/Articles"): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like this one /
might be too much here (or at the end of the temp_address = root+"/"+directory+"/"
line) since together they convert to temp_address//Articles
and there is an error when running python3 tangent_cft_front_end.py -ds "/NTCIR12_MathIR_WikiCorpus_v2.1.0/MathTagArticles" ...
```sh | ||
for x in *.tar.bz2; do echo $x; tar xjf $x; done | ||
``` | ||
|
||
Also one can easily use anydataset, such as [Math Stach Exchange] (https://math.stackexchange.com/), in form of csv file of latex formula and formula ids (separated by $$ sign) to train a new model. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It could be helpfull to provide either a more specific link to the mentioned dataset or any example of such .csv
file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what I have mentioned is refering the link above: https://www.cs.rit.edu/~rlaz/NTCIR-12_MathIR_Wikipedia_Corpus.zip
related to issue #5