How can I add Arabic as a recognition language to Lios?

Linux for blind general discussion blinux-list at redhat.com
Mon Dec 20 18:33:53 UTC 2021


Thanks Didier,

I will send you a PDF I just tried to OCR with Lios and got gibbirish. I 
will send to you on your diirect email.

Cheers,

Ibrahim

On 12/20/21 12:22 PM, Linux for blind general discussion wrote:
> Hi Ibrahim,
>
> That is is what I understood, but I would like to know if the same issue occurs
> with a PDF file, hence my requests that we both test with the same PDF file.
> Even more so as I do not have a scanner at hand right now to reproduce the test
> you did.
>
> Cheers,
> Didier
>
> Le 20/12/2021 à 18:12, Linux for blind general discussion a écrit :
>> Hi Didier,
>> I did not do the testing on a PDF file. I did the testing in direct scan. I put the page on a flatbed scanner and hit the scan and recognize menu item in Lios.
>> Cheers,
>> Ibrahim
>>
>> Sent from my iPhone
>>
>>> On Dec 20, 2021, at 12:04 PM, Linux for blind general discussion <blinux-list at redhat.com> wrote:
>>>
>>> Hi Ibrahim,
>>>
>>> Not sure where the issue is.
>>>
>>> To investigate, could you please send provide a link to a pdf in Arabic (or send
>>> me privately the pdf itself) that we could both try?
>>>
>>> Thanks
>>>
>>> Cheers,
>>> Didier
>>>
>>>> Le 20/12/2021 à 05:09, Linux for blind general discussion a écrit :
>>>> Thanks Didier,
>>>>
>>>> Unfortunately, I tried your suggestion and still got zero accuracy of OCR of
>>>> Arabic letters. I typed a Paragraph of Arabic and printed and then scanned, but
>>>> the outcome was gibbirish, all in latin characters. I typed a paragraph in
>>>> English and printed it out, when I scanned, the OCR accuracy was excellent.  So,
>>>> ther is some problem with the Arabic recognition.  It seems to me that although
>>>> Arabic is listed, the OCR engine is not actually trying to recognize Arabic.
>>>>
>>>> Cheers,
>>>>
>>>> Ibrahim
>>>>
>>>>> On 12/17/21 6:48 PM, Linux for blind general discussion wrote:
>>>>> Hi Ibrahim,
>>>>>
>>>>> You do not need to add anything special, the files
>>>>> /usr/share/tessdata/Arabic.traineddata being included in the package
>>>>> tesseract-data in Slint.
>>>>>
>>>>> Juts open lios, then in menu select Preferences then Preferences recognition and
>>>>> select:
>>>>> Engine: Tesseract
>>>>> Language: Arabic
>>>>>
>>>>> I don't have a scanner at hand but downloaded this file:
>>>>> https://fada.birzeit.edu/bitstream/20.500.11889/6910/1/mkhaldi%20Sahar%20Research.pdf
>>>>>
>>>>> then I opened it in Lios (menu File then Open).
>>>>>
>>>>> The file was recognized and the text properly extracted.
>>>>>
>>>>> Copying a paragraph of the extracted text and pasting it in translate.google.fr
>>>>> allowed me to read it in French <smile>
>>>>>
>>>>> Cheers,
>>>>>
>>>>> Didier
>>>>>
>>>>> Le 18/12/2021 à 00:10, Linux for blind general discussion a écrit :
>>>>>> Hi All,
>>>>>>
>>>>>> This question is primarily to Didier:
>>>>>>
>>>>>> How can I add Arabic dictionary to Lios so that I can use my scanner to scan
>>>>>> Arabic text? I assume I will also be able to run Arabic.pdf files through Lios
>>>>>> and as such I will have access to a lot of Arabic books available on the net.
>>>>>>
>>>>>> Cheers,
>>>>>>
>>>>>> Ibrahim
>>>
>>> _______________________________________________
>>> Blinux-list mailing list
>>> Blinux-list at redhat.com
>>> https://listman.redhat.com/mailman/listinfo/blinux-list
>> _______________________________________________
>> Blinux-list mailing list
>> Blinux-list at redhat.com
>> https://listman.redhat.com/mailman/listinfo/blinux-list
>
> _______________________________________________
> Blinux-list mailing list
> Blinux-list at redhat.com
> https://listman.redhat.com/mailman/listinfo/blinux-list




More information about the Blinux-list mailing list