this post was submitted on 29 Apr 2025
10 points (100.0% liked)

Free and Open Source Software

18632 readers
46 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

I found Adobe acrobat was good at ocr but I’m on Linux now.

you are viewing a single comment's thread
view the rest of the comments
[–] Toes@ani.social 4 points 3 days ago (1 children)

I'm not super familiar with the subject but it'll probably be something based on Tesseract.

Maybe try gImageReader.

[–] e0qdk@reddthat.com 3 points 3 days ago* (last edited 3 days ago)

You can use tesseract -l jpn input.png - on the command line to have it print out the text from input.png into the console if you've got the language files for Japanese installed. (There's also language files for vertical text and a few others for script in my package manager.) Alternatively give the filename (w/o extension) instead of - to write the output into a .txt file.

On Mint, I think I did sudo apt install tesseract-ocr tesseract-ocr-jpn to get it working for the simple case of horizontal text; been a while though.