Announcement

**aufkrawall** · 16 August 2021, 02:13 PM

Curious about AVX512 optimizations, it makes a dramatic difference in libdav1d with 10 bit content (as Rocket Lake kills Zen 3 here). If this will be true for Tesseract too, its usefulness probably should be re-evaluated.

**elatllat** · 16 August 2021, 02:20 PM

LOL, I'm finishing a week long OCR run now. It's not obvious if fast float is enabled in the beta binaries or how to enable it in the build but maybe it's worth looking into...

**coder** · 16 August 2021, 02:52 PM

Traditionally the Tesseract OCR engine has relied upon doubles

WTF??

Did these clowns actually write their own deep learning framework? If so, they should ditch it and use one of the others, for probably an overnight order-of-magnitude speedup. Not exaggerating.

**coder** · 16 August 2021, 02:57 PM

Originally posted by aufkrawall View Post

Curious about AVX512 optimizations, it makes a dramatic difference in libdav1d with 10 bit content (as Rocket Lake kills Zen 3 here).

Look at other CPU-based deep learning benchmarks. OpenVINO is a good one, because both its AVX2 and AVX-512 paths are well-optimized. Michael has some other deep learning benchmarks in PTS that are pretty rubbish, in that they don't represent realistic performance on CPU.

As for DAV1D, that's another rubbish comparison, simply because the non- AVX-512 path for 10-bit wasn't comparably optimized. So, it's not an apples-to-apples comparison.

**aufkrawall** · 16 August 2021, 03:43 PM

More optimization for non-AVX512 path in libdav1d is only a matter of time then? It's been a while since they started using it.

**Joe2021** · 16 August 2021, 03:46 PM

So why were double instead of float used in the first place? Out of a habit?

**Chewi** · 16 August 2021, 03:51 PM

Originally posted by coder View Post

WTF??

Did these clowns actually write their own deep learning framework? If so, they should ditch it and use one of the others, for probably an overnight order-of-magnitude speedup. Not exaggerating.

I don't know whether theirs is any good or not, but you should know that Tesseract was started in the 1980s, making it waaay older than any of the other things you're thinking of.

**Aryma** · 16 August 2021, 03:52 PM

Tesseract always give me funny result when i OCR some japanese document

**coder** · 16 August 2021, 05:48 PM

Originally posted by aufkrawall View Post

More optimization for non-AVX512 path in libdav1d is only a matter of time then? It's been a while since they started using it.

Someone probably has to fund the work. I forget who, but someone paid them to do the AVX-512 optimization of the 10-bit path. I think it was like Netflix or someone like that -- not Intel.

Announcement

Tesseract 5.0 OCR Engine Bringing Faster Performance With "Fast Floats"

Tesseract 5.0 OCR Engine Bringing Faster Performance With "Fast Floats"

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment