How can I add Arabic as a recognition language to Lios?

Linux for blind general discussion blinux-list at redhat.com
Mon Dec 20 17:12:47 UTC 2021


Hi Didier,
I did not do the testing on a PDF file. I did the testing in direct scan. I put the page on a flatbed scanner and hit the scan and recognize menu item in Lios.
Cheers,
Ibrahim

Sent from my iPhone

> On Dec 20, 2021, at 12:04 PM, Linux for blind general discussion <blinux-list at redhat.com> wrote:
> 
> Hi Ibrahim,
> 
> Not sure where the issue is.
> 
> To investigate, could you please send provide a link to a pdf in Arabic (or send
> me privately the pdf itself) that we could both try?
> 
> Thanks
> 
> Cheers,
> Didier
> 
>> Le 20/12/2021 à 05:09, Linux for blind general discussion a écrit :
>> Thanks Didier,
>> 
>> Unfortunately, I tried your suggestion and still got zero accuracy of OCR of
>> Arabic letters. I typed a Paragraph of Arabic and printed and then scanned, but
>> the outcome was gibbirish, all in latin characters. I typed a paragraph in
>> English and printed it out, when I scanned, the OCR accuracy was excellent.  So,
>> ther is some problem with the Arabic recognition.  It seems to me that although
>> Arabic is listed, the OCR engine is not actually trying to recognize Arabic.
>> 
>> Cheers,
>> 
>> Ibrahim
>> 
>>> On 12/17/21 6:48 PM, Linux for blind general discussion wrote:
>>> Hi Ibrahim,
>>> 
>>> You do not need to add anything special, the files
>>> /usr/share/tessdata/Arabic.traineddata being included in the package
>>> tesseract-data in Slint.
>>> 
>>> Juts open lios, then in menu select Preferences then Preferences recognition and
>>> select:
>>> Engine: Tesseract
>>> Language: Arabic
>>> 
>>> I don't have a scanner at hand but downloaded this file:
>>> https://fada.birzeit.edu/bitstream/20.500.11889/6910/1/mkhaldi%20Sahar%20Research.pdf
>>> 
>>> then I opened it in Lios (menu File then Open).
>>> 
>>> The file was recognized and the text properly extracted.
>>> 
>>> Copying a paragraph of the extracted text and pasting it in translate.google.fr
>>> allowed me to read it in French <smile>
>>> 
>>> Cheers,
>>> 
>>> Didier
>>> 
>>> Le 18/12/2021 à 00:10, Linux for blind general discussion a écrit :
>>>> Hi All,
>>>> 
>>>> This question is primarily to Didier:
>>>> 
>>>> How can I add Arabic dictionary to Lios so that I can use my scanner to scan
>>>> Arabic text? I assume I will also be able to run Arabic.pdf files through Lios
>>>> and as such I will have access to a lot of Arabic books available on the net.
>>>> 
>>>> Cheers,
>>>> 
>>>> Ibrahim
> 
> 
> _______________________________________________
> Blinux-list mailing list
> Blinux-list at redhat.com
> https://listman.redhat.com/mailman/listinfo/blinux-list



More information about the Blinux-list mailing list