-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trying to understand cparents.txt in Constituency parsing #23
Comments
Another example: a.txt -> Two dogs are playing by a tree |
For N tokens, you'll obtain a binary tree of 2N-1 nodes. |
First assign numbers from 0 to length-1 to the initial sentence in a.txt. These number stands for indices of leaf nodes. Write cparents as following, the numbers here stands for the indices of a node, -1 stands for the root node: |
Thank you very much !!!!!! |
|
@venusafroid |
I have downloaded the SICK data and obtained the dependency and constituency parsing with the fetch_and_preprocess.sh file.
I am now trying to understand what is the information that is generated in the cparents.txt file.
This is an example:
a.txt -> Two dogs are fighting
a.cparents.txt -> 5 5 7 7 6 0 6
If I am not mistaken, from the cparents.txt I should be able to build the parse tree. Is that right? And how would the tree for this example look like?
Thanks for any help in advance
The text was updated successfully, but these errors were encountered: