OCRing Music from YouTube with Common Lisp: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 11: Line 11:
= Attempt 1: Tesseract =
= Attempt 1: Tesseract =


Of all the classical OCR libraries out there, Tesseract is probably the most famous. There are a few knobs to tweak on it, but in general you just chuck your image at it and let it rip. I honestly figured this would work really well, since this is monospaced, easily readable characters that should theoretically be a perfect match for these old-skool OCR techniques. There's a Lisp binding out there for it: https://github.com/GOFAI/cl-tesseract, so I quickly grabbed it, pointed it at a sample image, and...
Of all the classical OCR libraries out there, Tesseract is probably the most famous. There are a few knobs to tweak on it, but in general you just chuck your image at it and let it rip. I honestly figured this would work really well, since this is monospaced, easily readable characters that should theoretically be a perfect match for these old-skool OCR techniques. There's a [https://github.com/GOFAI/cl-tesseract Lisp binding out there for it], so I quickly grabbed it, pointed it at a sample image, and...


[[File:Article2.png|600px]]
[[File:Article2.png|600px]]