The project is hosted by the Apache Software Foundation. It supports detecting various file and content types. There is a full list of supported formats. When having a look at the list that displays the supported formats, many document formats are listed in there. E.g.
text/xml, the propritary Microsoft OOXML or the office standard Open Document. Furthermore images (
image/tiff), videos (
video/mp4) and audios (
audio/mpeg) can be recognized by
Tika. Even feeds (
application/atom+xml) may be recognized. And many, many more … Continue reading