Feature #1708
For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases
Description
Looking at the screenshot attached below, we can see "iteration", "iterable" & "iterator" are tagged very closely. That is happening because the lemmatized forms of these KPs are not the same. We need to make the following changes in the code to make sure this does not happen:
1) Add a wrapper for lemmatization in the code to make sure cases like these are handled explicitly. This can be added in common_tagging_functions.py
2) Make sure if the KPs are located very closely, we need to pick the most similar one among them for tagging.
You can also refer to the case of "modules" in URL https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?documentURL=10014%2Fprd%2F3%2Ftutorial%2Fmodules--Packages-in-Multiple-Directories-53.html
Files
Updated by Nandini Bansal about 3 years ago
- Assignee set to Nandini Bansal
- % Done changed from 0 to 100
- Estimated time set to 1.00 h