[sword-devel] Character Frequency
    Peter von Kaehne 
    refdoc at gmx.net
       
    Fri Jul  8 03:06:53 MST 2011
    
    
  
On Fri, 2011-07-08 at 01:17 -0700, David Haslam wrote:
> For projects that begin at USFM (or earlier), it would be great to develop a
> tool that analyses character frequency of the text (for the whole Bible)
> apart from all the USFM tags, etc.
Done for USFM.
sword-tools/modules/misc_cleanup/usfm_charmap.pl
Anything build from XML (this includes files coming out of e.g. a styled
MS word document, once exported properly, e.g to abiword.xml) the
previously mentioned will do the job largely. Shortcomings there would
be verse and chapter numbers are usually part of the pain text.
Peter
    
    
More information about the sword-devel
mailing list