Feature #1809
Checking the context of the KP matched with "word1.word2" header variant
28%
Description
We have generally observed if the KP is matching with the "word1.word2" header variant with KP being equivalent to "word2", the linking of KP and the similar doc is out of context at most of the places as these subsection headers are quite specific to the documents they are part of.
For e.g. KP = method
header variant1 = class.method
header variant2 = request.method
KP is linked to both "header variant1" and "header variant2" with high confidence scores but they might or might not be relevant as "header variant2" is very specific about the topic it is relevant for.
To test this idea, we can fetch the sentence from which the KP is extracted and check if "word1" is present in it. If "word1" is an abbreviation, check for both "word1" & full-form of the "word1" in the context sentence.
Added screenshot of KP "dump" getting matched to "pickle.dump", "ast.dump", etc subsections with high confidence but they are out of context due to their specific nature.
Implementation in generate_candidates function.
Very important to check with the C-API book & Library Reference book.
Files
Subtasks