The lower case tokenizer considers white spaces and non-letters as delimiters, splits the input string at these delimiters, and then discards all delimiters. Finally, it converts all letters to lowercase.
Factory class: solr.LowerCaseTokenizerFactory
Arguments: None
Example:
<fieldType name="text_en" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.LowerCaseTokenizerFactory"/>
</analyzer>
</fieldType>
Input: Please send a mail at [email protected] by 12-Nov.
Output: please, send, a, mail, at, dharmesh, vasoya, example, com, by, nov
The input string was first split at white spaces and punctuation and then converted to lowercase.