Feature #1815
Implement a new context algorithm for the KPs matching with <word1.word2> subsection headers
Start date:
10/28/2021
Due date:
% Done:
0%
Estimated time:
2.00 h
Description
In continuation of #1810
We need to implement a new context matching algorithm for the KPs matching with <word1.word2> subsection headers. We will make use of parent header information of the KPs and similar docs to match/check the context. The intuition is that KP doc and similar doc falling under the same group/hierarchy of the headers have a higher probability of being related to each other as compared to other similar docs.
Save the parent header info in the text file as well to identify patterns in the KP and similar doc.