Commit Graph

1 Commits

Author SHA1 Message Date
b8b8817cd4 feat(notes): add guide for OCR and PDF manipulation on Linux
- Added instructions for setting up Tesseract with language support.
- Documented steps for converting PDFs to images using `pdftoppm` and alternatives like `ImageMagick`.
- Included examples for single and multi-page OCR processing.
- Detailed methods for merging extracted text into a single file.
- Added troubleshooting tips for improving OCR results and handling selectable PDFs with `pdftotext`.
2024-12-05 16:09:04 -05:00