[sword-devel] Does the CLucene indexing work for non-English texts?
    DM Smith 
    dmsmith at crosswire.org
       
    Thu Nov  1 14:42:09 MST 2018
    
    
  
From memory, SWORD uses SimpleAnalyzer. This analyzer works well for Western European languages. It won’t for non-latinate texts. It may work in part.
The basic rule of thumb is that both the index has to be created with an analyzer and the search request has to be analyzed the same.
PocketSword uses externally created indexes which need to be downloaded to work. It uses the SWORD library for creation and for searching.
In Him,
	DM
> On Nov 1, 2018, at 4:14 PM, TS <outofthecube at icloud.com> wrote:
> 
> Does the CLucene indexing work for non-English texts?
> 
> David's recent question about languages without spaces caused me to be a bit curious about this matter. Briefly looking at the current Apache Lucene code, their appears to be extra code for non-English text. However, this is in comparison to the Clucene code for PocketSword. And I seem to recall that, in general, the CLucene code in PocketSword may only be for reading indices and not for writing them. Also, to clarify  further, it is possible that an index is created, but with errors?
>     For example, when I search Koine Greek texts, it does so erroneously. I think that I'll start a separate post regarding the Greek search and indexing in specific. 
> 
> -TS
> 
> --Sent from phone--
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
    
    
More information about the sword-devel
mailing list