atom feed2 messages in org.apache.lucene.solr-userRE: mailto: scheme aware tokenizer
FromSent OnAttachments
Kai GülzauMar 16, 2012 6:59 am 
Steven A RoweMar 18, 2012 8:09 am 
Subject:RE: mailto: scheme aware tokenizer
From:Steven A Rowe (sar@syr.edu)
Date:Mar 18, 2012 8:09:53 am
List:org.apache.lucene.solr-user

Hi Kai,

I have created an issue for this:
https://issues.apache.org/jira/browse/LUCENE-3880

Thanks for reporting!

Steve

-----Original Message----- From: Kai Gülzau [mailto:kgue@novomind.com] Sent: Friday, March 16, 2012 9:59 AM To: solr@lucene.apache.org Subject: mailto: scheme aware tokenizer

Is there any analyzer out there which handles the mailto: scheme?

UAX29URLEmailTokenizer seems to split at the wrong place:

mailto:te@example.org -> mailto:test example.org

As a workaround I use

<charFilter class="solr.PatternReplaceCharFilterFactory" pattern="mailto:"
replacement="mailto: "/>

Regards,

Kai Gülzau

novomind AG

__________________________________

Bramfelder Straße 121 • 22305 Hamburg

phone +49 (0)40 808071138 • fax +49 (0)40 808071-100 email kgue@novomind.com
http://www.novomind.com

Vorstand : Peter Samuelsen (Vors.) • Stefan Grieben • Thomas Köhler Aufsichtsratsvorsitzender: Werner Preuschhof Gesellschaftssitz: Hamburg • HR B93508 Amtsgericht Hamburg