Project

General

Profile

Task #1897

Skip the repeating KPs within 50 words according to new scheme

Added by Nandini Bansal about 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
Normal
Target version:
Start date:
11/17/2021
Due date:
% Done:

0%

Estimated time:
2.50 h

Description

The repeating KPs within 50 words should be skipped as per the new scheme:

If multiple instances of the same KP are tagged within the threshold (50 words) like k1, k2, k3 (same order)
Case 1) if sk1 < sk2 < sk3, only k3 will be tagged
Case 2) if sk1 < sk2 = sk3, both k2 & k3 will be tagged
Case 3) if sk1 > sk2 >= sk3, only k1 will be tagged
Case 4) if sk1 < sk2 >= sk3, only k2 will be tagged

Note: skn = sim_score of kn

The case where sim_scores will be the same for all KPs will be handled in a different ticket. Changes to be done in "check_repetition_multi_occurence" function

Also available in: Atom PDF