Project

General

Profile

Actions

Defect #20730

closed

Fix tokenization of phrases with non-ascii chars

Added by Jens Krämer over 9 years ago. Updated about 9 years ago.

Status:
Closed
Priority:
Normal
Category:
Search engine
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Resolution:
Fixed
Affected version:

Description

\w only matches ASCII characters, we should either use [:alnum:] instead or simply match all non-" characters for the phrase. Test case included.


Files

Actions

Also available in: Atom PDF