this post was submitted on 14 Apr 2025
5 points (100.0% liked)

Machine Learning

1957 readers
5 users here now

founded 4 years ago
MODERATORS
 

cross-posted from: https://lemm.ee/post/61282397

Open sourcing this project I made in just a weekend, planning to continue this in my free time, with synthetic data gen and some more modifications, anyone is welcome to chip in, I'm not an expert in ML. The inference is live here using tensorflow.js. The model is just 1.92 Megabytes!

you are viewing a single comment's thread
view the rest of the comments
[–] DontNoodles@discuss.tchncs.de 1 points 3 weeks ago* (last edited 3 weeks ago)

I get all your points and I think they are the reason this has not been solved yet. But at times like this, i take inspiration from the story of first version of Captcha that, I think, Yahoo! created. The simplicity of using two words, one known and the other unknown to practically get all-printed-words-ever transcribed is nothing short of awe inspiring. If the Indian government were to put all words in regional languages as a part of Indian version of such Captcha just to book tickets on Indian railways then the entirety of regional language text could be transcribed before we know it, besides giving valuable training datasets for ML/DL models too.

Nonetheless, i wish you the very best in your endeavours.