Hyphen library: What does "hyphenation vector" mean? -


hyphen library seems popular , free way have hyphenation in app.

what hyphenation vector mean?

i running example attached library source code. example output:

hibernate   // input word 030412000   // output hyphenation vector hi=ber=nate  // hyphen points  - hi=bernate  - hiber=nate 

odd numbers in vector indicate hyphenation points. but, of values mean?

lászló németh describes algorithm in openoffice's documentation in full detail.

the library uses algorithm developed frank m. liang ("word hy-phen-a-tion com-pu-ter"): letters in digrams, trigrams, , longer patterns assigned numerical values indicate it's 'usual' place (an odd number) or 'unusual' place (an number) hyphen occur. higher number is, greater importance -- pattern never broken on larger number, , on larger odd number. number sequences statistically determined on corpus of pre-hyphenated words.

note numbers positions between 2 characters. better notation have been

h b e r n t e  0 3 0 4 1 2 0 0 (0) 

(where last 0 obsolete).


Comments

Popular posts from this blog

ruby on rails - RuntimeError: Circular dependency detected while autoloading constant - ActiveAdmin.register Role -

c++ - OpenMP unpredictable overhead -

javascript - Wordpress slider, not displayed 100% width -