StanfordNLP sentence segmenter bug #152
Labels
bug
Something isn't working
priority: medium
topic: processors
Issues related to design and development of processors
While trying to find sentence boundaries, the technique to find the sentence ending can fail.
We are using
find
which gives the first occurrence of a word in a sentence. This will definitely fail when there are 2 duplicate words in a sentence.https://github.com/asyml/forte/blob/master/forte/processors/stanfordnlp_processor.py#L72
The text was updated successfully, but these errors were encountered: