You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
that XML file looks interesting and would contain all information for retrieving the article. However, I can't find it in the repository, there is just a file called german-comment-ids.txt.gz which contains the id only. You need either the link_id or the permalink for downloading a Reddit object.
We would like to rebuild the corpus, perform even more language detection and extend it.
This is a very interesting project.
Could you also provide the
link_id
in addition to theid
itself? This would allow construction of a valid Reddit URL for each comment.The text was updated successfully, but these errors were encountered: