[R] double metaphone for non-English languages

Dimitri Liakhovitski dimitri.liakhovitski at gmail.com
Thu Dec 8 18:17:47 CET 2016


I am processing some strings using package "PGRdup".
I using function 'DoubleMetaphone' to generate the double metaphone
codes of strings - to match words that sound alike (in case they were
written differently).

However, I got some French and German strings and discovered that
DoubleMetaphone can't process non-ASCII characters (like Umlauts in
German, accents in French).

What would you recommend I use instead?
Use 'phonetic' from stringdist?

Thank you!
Dimitri Liakhovitski

More information about the R-help mailing list