this post was submitted on 29 Apr 2025
9 points (100.0% liked)

Free and Open Source Software

18629 readers
16 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

I found Adobe acrobat was good at ocr but I’m on Linux now.

top 3 comments
sorted by: hot top controversial new old
[–] Toes@ani.social 4 points 1 day ago (1 children)

I'm not super familiar with the subject but it'll probably be something based on Tesseract.

Maybe try gImageReader.

[–] e0qdk@reddthat.com 3 points 21 hours ago* (last edited 21 hours ago)

You can use tesseract -l jpn input.png - on the command line to have it print out the text from input.png into the console if you've got the language files for Japanese installed. (There's also language files for vertical text and a few others for script in my package manager.) Alternatively give the filename (w/o extension) instead of - to write the output into a .txt file.

On Mint, I think I did sudo apt install tesseract-ocr tesseract-ocr-jpn to get it working for the simple case of horizontal text; been a while though.

[–] Successful_Try543 4 points 1 day ago* (last edited 1 day ago)

Tesseract along with the desired language pack should do the OCR part and as a GUI, you can e.g. use lios or others.