RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
Jun 12, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Docker Image with latest Tesseract OCR Version 5.x.x built from sources
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
A bookshelf for your mokuro-scanned books with database storage
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga translation tool powered by deeplearning
extracts work schedules from screenshots and creates calendar events for them. uses tesseract ocr engine
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
img2txt Android app
The ultimate open-source RAG framework
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memory and productivity without compromising your privacy.
Vision utilities for web interaction agents 👀
Tesseract Open Source OCR Engine (main repository)
Shared services serves as a ready made solutions to most of the code snippets required for back-end services development in spring boot.
Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nomnaocr
Ruby gem for communicating with the Veryfi OCR API.
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."