Planet PDF Forum Planet PDF Forum
  New Posts New Posts RSS Feed - Issue with non-unicode encoded font - hebrew pdf
  FAQ FAQ  Forum Search   Register Register  Login Login


Hi, welcome to the Foxit Planet PDF Forum. If you have PDF or Adobe Acrobat questions then the right place to ask them is here, in this forum.

Issue with non-unicode encoded font - hebrew pdf

 Post Reply Post Reply
Author
ralphza View Drop Down
New Member
New Member


Joined: 21 Jul 2017
Points: 1
Post Options Post Options   Quote ralphza Quote  Post ReplyReply Direct Link To This Post Topic: Issue with non-unicode encoded font - hebrew pdf
    Posted: 21 Jul 2017 at 1:51pm
Hi

I'm having an issue copy/pasting or converting a PDF to DOC or TXT..

I have this PDF file  I have uploaded it here  on ge.tt http://ge.tt/3WYLgrl2    and on uploadfiles.io  https://ufile.io/pbnr3 ;

The file is 2MB in size.

Courier New supports Hebrew characters e.g. \u05D0  (hebrew letter aleph). So I can use the Courier New font in notepad, to show hebrew.

If I copy/paste the hebrew in that PDF, into notepad, then I get funny or different characters

I am familiar with C#

I wonder if I can somehow open the PDF in C#, determine what Font is used, what encoding is used, and change the encoding to UTF-8..

I don't really just want it in notepad.. I want a PDF file with all that or similar formatting that is in that PDF, but with a font / encoding that I can copy/paste into notepad.

I wonder if anybody here has any ideas about how to go about that?

Thanks

Back to Top
Sponsored Links


Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.10
Copyright ©2001-2017 Web Wiz Ltd.

This page was generated in 0.031 seconds.