

![]() | Start a set with this search |
![]() | Include this search in one of my sets |
![]() | Exclude this search from one of my sets |
![]() | Permalink to these results Paste this link in email or IM: |
| Atom feed for tracking future search results Paste this URL into your reader: |
17 messages in org.apache.lucene.java-userRe: Wikia search goes live today| From | Sent On | Attachments |
|---|---|---|
| Lukas Vlcek | Jan 7, 2008 4:48 am | |
| Grant Ingersoll | Jan 7, 2008 5:13 am | |
| Grant Ingersoll | Jan 7, 2008 8:21 am | |
| Otis Gospodnetic | Jan 7, 2008 2:14 pm | |
| Lukas Vlcek | Jan 7, 2008 11:48 pm | |
| Lukas Vlcek | Jan 7, 2008 11:54 pm | |
| Grant Ingersoll | Jan 8, 2008 4:46 am | |
| Mike Klaas | Jan 8, 2008 11:59 am | |
| Dennis Kubes | Jan 8, 2008 12:09 pm | |
| Michael Stoppelman | Jan 8, 2008 12:11 pm | |
| Lukas Vlcek | Jan 8, 2008 12:15 pm | |
| Andrzej Bialecki | Jan 8, 2008 12:23 pm | |
| Ryan McKinley | Jan 8, 2008 12:31 pm | |
| Lukas Vlcek | Jan 8, 2008 12:36 pm | |
| Lukas Vlcek | Jan 8, 2008 12:38 pm | |
| Andrzej Bialecki | Jan 8, 2008 2:23 pm | |
| Dennis Kubes | Jan 8, 2008 2:53 pm |

![]() | Permalink for this message Paste this link in email or IM: |
![]() | Permalink for this thread Paste this link in email or IM: |
| Atom feed for this thread Paste this URL into your reader: |
| Subject: | Re: Wikia search goes live today | Actions... |
|---|---|---|
| From: | Lukas Vlcek (luka...@gmail.com) | |
| Date: | Jan 8, 2008 12:38:11 pm | |
| List: | org.apache.lucene.java-user | |
I should note that this technique is probably not easily applicable to current Lucene scoring mechanism without additional development.
On 1/8/08, Lukas Vlcek <luka...@gmail.com> wrote:
After checking the Lucene API of ParallelReader it seems that the star score could be stored in different index which shares the same identifier for the documents. Such index could be small (partitioned to many small indices?) so the updates can be fast. Is that what you meant Andrzej? ;-)
Anyway, I remember different technique which I once mentioned in Lucene mail list taking inspiration from book called Programming Collective Intelligence <http://www.oreilly.com/catalog/9780596529321/> . The idea is not to store score (may be I should call it user preference) into index but into neural net. One useful side effect is that this technique could score reasonably even document without any stars (meaning "similar" document to highly started documents could score better even if they haven't been stared by any user yet).
Regards, Lukas
On 1/8/08, Andrzej Bialecki <ab...@getopt.org> wrote:
Lukas Vlcek wrote:
So staring will be accommodated only during indexing phase. Does it mean it will be pretty static value not a dynamically changing variable... correct? In other words if I add my starts to some document it won't affect the
scoring immediately but after indexing cycle. Correct?
(I'm not involved in Wikia development). There are some ways to go about it even in the pure Lucene-land, so that the updates are fast without reindexing the main content. Hint: ParallelReader.
-- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com







