Fast and accurate AI powered file content types detection
Magika is a novel AI-powered file type detection tool that relies on the
recent advance of deep learning to provide accurate detection. Under the hood,
Magika employs a custom, highly optimized model that only weighs about a few
MBs, and enables precise file identification within milliseconds, even when
running on a single CPU. Magika has been trained and evaluated on a dataset of
~100M samples across 200+ content types (covering both binary and textual file
formats), and it achieves an average ~99% accuracy on our test set.