Planet PDF Forum Planet PDF Forum
  New Posts New Posts RSS Feed - Unicode mapping to Symbol font?
  FAQ FAQ  Forum Search   Register Register  Login Login

Hi, welcome to the Foxit Planet PDF Forum. If you have PDF or Adobe Acrobat questions then the right place to ask them is here, in this forum.

Unicode mapping to Symbol font?

 Post Reply Post Reply
Author
eekboom View Drop Down
New Member
New Member


Joined: 01 Oct 2012
Location: Hamburg,Germany
Points: 6
Post Options Post Options   Quote eekboom Quote  Post ReplyReply Direct Link To This Post Topic: Unicode mapping to Symbol font?
    Posted: 01 Oct 2012 at 8:30pm
The unicode standard explicitly mentions Zapf Dingbats and includes code points on mostly continuous order.

However for the standard PDF "Symbol" font: Is there a mapping from unicode code points?
Has anyone bothered to find the best matching unicode code for each glyph?
Back to Top
aandi View Drop Down
Senior Member
Senior Member


Joined: 07 Jul 2011
Points: 18358
Post Options Post Options   Quote aandi Quote  Post ReplyReply Direct Link To This Post Posted: 02 Oct 2012 at 12:02pm
I'd say that for a symbol font (of which Symbol is surely an example along with Zapf Dingbats) it isn't appropriate for text extraction to try to convert to Unicode.  My expectation at least would be that what is copied is the font's code points, which can be pasted into an application, then have the font applied to it. Now we have back the original characters.
 
If it were converted to Unicode, pasted, then had the font applied I don't think the results would be what were expected.
 
Of course, there may be specialist applications where this isn't an applicable problem or strategy.
Back to Top
aandi View Drop Down
Senior Member
Senior Member


Joined: 07 Jul 2011
Points: 18358
Post Options Post Options   Quote aandi Quote  Post ReplyReply Direct Link To This Post Posted: 02 Oct 2012 at 12:37pm
That said, Acrobat doesn't seem to do what I said; perhaps it can't. What it does isn't helpful though. It looks as though it copies the same string for the same codepoints in Symbol or Zapf Dingbats, but the string isn't useful as codepoints, Unicode, or anything else.
Back to Top
eekboom View Drop Down
New Member
New Member


Joined: 01 Oct 2012
Location: Hamburg,Germany
Points: 6
Post Options Post Options   Quote eekboom Quote  Post ReplyReply Direct Link To This Post Posted: 03 Oct 2012 at 9:47pm
Thanks for your answers!

Sorry, should have described my use case.
It is quite the opposite:
I want to create a PDF in Java and map as much characters as possible from my Java Unicode Strings to glyphs without needing to embed a font (so I intend to switch from the selected latin font like Courier to Symbol or Zapf Dingbats and back).

But of course your answer already has the simple obvious solution: Just copy the text from the PDF spec's encoding tables in appendix D using Adobe Reader and paste it into a source code file with unicode encoding (in my unicode capable Java IDE).
That works very well for me for almost all characters. I guess for some characters there simply don't exist any unicode code points (e.g. the top/middle/lower parts of braces like bracelefttp).

If it does not work for you I guess it is more a problem of the client application where you try to paste the text to. If I view the clipboard (using Peter Buettner's ClipView) I see that Adobe Reader on Windows copies the text in these clipboard formats simultaneously (CF_TEXT, CF_LOCALE, CF_OEMTEXT, CF_UNICODETEXT, Rich Text Format).

Back to Top
eekboom View Drop Down
New Member
New Member


Joined: 01 Oct 2012
Location: Hamburg,Germany
Points: 6
Post Options Post Options   Quote eekboom Quote  Post ReplyReply Direct Link To This Post Posted: 03 Oct 2012 at 11:25pm
By chance I found this:
   http://unicode.org/Public/MAPPINGS/VENDORS/ADOBE/
where the ReadMe.txt mentions
   http://www.adobe.com/devnet/opentype/archives/glyph.html
which contains links to the mapping from Zapf Dingbats to unicode:
   http://www.adobe.com/devnet-archive/opentype/archives/zapfdingbats.txt
and from glyph names used in Symbol to unicode:
   http://www.adobe.com/devnet-archive/opentype/archives/glyphlist.txt

Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.10
Copyright ©2001-2017 Web Wiz Ltd.

This page was generated in 0.031 seconds.