Project

General

Profile

Bug #1743

Task #1726: Handling cases of bad header variants like "representation"

Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list

Added by Nandini Bansal about 3 years ago. Updated about 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Target version:
Start date:
10/13/2021
Due date:
% Done:

0%

Estimated time:
1.50 h (Total: 3.50 h)

Description

An experimentation approach to further extend this task. Earlier we were only checking the tmp_var in the 4K CW list. We should now check for singular and plural forms of the tmp_var in the 4K CW list as well to increase the coverage area and discard as many bad header variants as possible.
Based on the list of the header variants we get, we should do the following:
1. Verify the standalone KPs generated by these header variants and check the context of the KPs


Subtasks

Bug #1744: Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variantsResolved10/13/2021

Actions

Also available in: Atom PDF