Total Pageviews

Thursday, 14 December 2023

WeOCR | 一个极简的 OCR 工具



WeOCR, an easy offline tool for OCR. A simple PWA for OCR, based on Tesseract.

推荐语:WeOCR ,一个极简的 OCR 工具,无需安装,支持离线图片文字提取,它具有如下功能特征:

    易于使用:只需上传或直接粘贴,即可识别图片文字;
    聚焦隐私:无需网络访问,本地处理,专注隐私保护;
    可扩展引擎:目前支持 Tesseract,后序待扩展;
    多语言:支持中文,英语等多种语;

WeOCR 可以上传或拖拽图片,支持 jpg, png, bmp, tiff, webp, 等格式,基于 Tesseract (https://github.com/tesseract-ocr/tesseract,开源 OCR 引擎)所构建,识别速度很快, 对于识别结果,它可拷贝、美化。如果您手头没有趁手的 AI 工具,在电脑端识别图片文字,倒是可以考虑。值得一提的是,该项目在 Github 仓库开源:https://github.com/plantree/ocr-pwa ,如果您对其感兴趣,可移步前往从而了解更多。

https://ocr.plantree.me/

(开源的OCR工具-WeOCR

WeOCR是一款开源的在线文字识别 OCR 工具,支持 PWA,可在浏览器中离线使用,项目基于开源 OCR 引擎 Tesseract,VUE编写,遵守MIT开源协议。

功能特色:
    易于使用。PWA可以从浏览器轻松安装,并且可以离线使用。
    注重隐私。基于 WebAssembly ,在 Web 运行原生工具,同时避免与服务器的交互。
    可扩展的引擎。目前仅支持 Tesseract,之后可能支持 PaddlePaddle。)

-----------------------------

ocr-pwa

A simple PWA for OCR, based on Tesseract.

Preview

https://ocr.plantree.me/

Introduction

This is a simple tool for OCR, and it can be used without network connection.

There're already many OCR tools on the market, even if iPhone and WeChat could do that easily. Why do I still do it?

There are about two reasons:

  1. I want to do OCR easily on the desktop, but I do not want to install any applications.
  2. I hope the tool could be used offline.

Therefore, this project has four features you might be interested:

  1. Easy to use. This is a PWA and you can install it easily from the browser and use is offline.
  2. Focus on privacy. Thanks to the development of WebAssembly, we can introduce some awesome native tools to the web, meanwhile avoid interactions with servers.
  3. Extendable Engines. Currently supports Tesseract only, and may support PaddlePaddle afterwards.
  4. Support Multiple Languages. Potential support list could be found here: https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html.

Develop

This project is built with Vite, Vue3 and TypeScript.

# 1. git clone
git clone git@github.com:plantree/ocr-pwa.git

# 2. Install depencies
pnpm install

# 3. Run
npm run dev
from https://github.com/plantree/ocr-pwa 
----
相关帖子:TrWebOCR-开源的离线OCR程序 

 

No comments:

Post a Comment