Project

General

Profile

Bug #1644

Testing Change: Modify the method of header variants generation using variations_in_common_section_words

Added by Nandini Bansal about 3 years ago. Updated about 3 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Target version:
Start date:
09/13/2021
Due date:
% Done:

0%

Estimated time:
4.00 h

Description

For variations_in_common_section_words, we need to modify the header variants generated by this function. It makes use of a list of words that are the top 50 most common words called "sect_common_words" in the dataset. According to the current implementation, we check whether a header variant starts or ends with a word in "sect_common_words" and if found, we truncate these words from these header variants to generate new header variants.

As per the new testing modifications, instead of just checking one word from start & end, we need to keep checking until we come across a word that is not a part of "sect_common_words". The leftover string (if any) will be added as a header variant.

Testing with Whirlwind Book, Tutorial Book & Lib Ref Book

#1

Updated by Nandini Bansal about 3 years ago

  • Assignee set to Anonymous
#2

Updated by Anonymous about 3 years ago

  • Status changed from New to Resolved
#3

Updated by Nandini Bansal about 3 years ago

  • Status changed from Resolved to Closed

Also available in: Atom PDF