Hello,
for my specific needs, i need to make a small adjustment to the way the nearest neighbor - levensthein algorithm works. In a step of creating the clusters, the algorithm removes all punctuation and control characters. (In this step, i need to also remove some stop words: "&", "and", "co", "the" for example).
Where in the source code should I make this change?
Thank you in advance for your help
in the knn folder, there are many files, and i need to spot the place where the normalisation of the string happens, so as there i could also remove temporarily the stop words.