Textcat ( http://www.let.rug.nl/~vannoord/TextCat/ ) can find out which language a text is in. I use it for choosing the correct languageanalyzer (Snowball) within Lucene indexing.