1 min readfrom Machine Learning

Detecting mirrored selfie images: OCR the best way? [D]

I'm trying to catch backwards "selfie" images before passing them to our VLM text reader and/or face embedding extraction. Since models like Qwen and Florence are trained on flipped data, they are mostly blind to backwards text and prompting them just seems to be fighting against their base training (i'm assuming they used lots of augmented flipped training data). My best idea right now is to run EasyOCR on the text crops and see if the normal or flipped version gets a higher read score. Is this OCR score trick really the best way to handle this, or is there a smart, small model approach I'm missing?

submitted by /u/dangerousdotnet
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#big data management in spreadsheets
#generative AI for data analysis
#conversational data analysis
#Excel alternatives for data analysis
#real-time data collaboration
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#rows.com
#OCR
#selfie
#backwards text
#flipped data
#EasyOCR
#model approach
#text reader
#VLM