Lower case tokenizer

The lower case tokenizer considers white spaces and non-letters as delimiters, splits the input string at these delimiters, and then discards all delimiters. Finally, it converts all letters to lowercase.

Factory class: solr.LowerCaseTokenizerFactory

Arguments: None

Example:

<fieldType name="text_en" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.LowerCaseTokenizerFactory"/>
</analyzer>
</fieldType>

Input: Please send a mail at [email protected] by 12-Nov.

Outputpleasesendamailatdharmeshvasoyaexamplecombynov

The input string was first split at white spaces and punctuation and then converted to lowercase.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.119.106.135