What font glyphs map to if they are not part of Unicode

What font glyphs map to if they are not part of Unicode



I am trying to understand the anatomy of a font file, and wondering about how glyphs are mapped to keyboard characters.



As part of that, I am also wondering what you do when you have font glyphs that are not part of unicode, such as with FontAwesome icons. FontAwesome uses them in CSS like this:


.fa4-clock-o:before
content: "f017";



If I wanted to let's say type these icons out using the keyboard, not sure what I would need to do to make that possible.



Wondering if I would have to do one of these:



Wondering if one could explain how to accomplish this at a medium level of detail (that is, I don't necessarily need to know the implementation specific details of how to do it yet, unless it's straightforward).






Try just going to fontawesome.com and clicking on the icons link I guess then. Thanks for letting me know.

– Lance Pollard
Sep 18 '18 at 19:04





2 Answers
2



A sane icon font offers GSUB ligature substitution, so that if your text contains a sequence of codepoints (like those associated with c + a + r) the font does a remapping to an internal glyph (like the car icon) so while typing, you see this:


c + a + r


c
co
com
comp
compu
comput
compute
💻



You typed "computer", once the font sees that last r it triggers the GSUB rule for c + o + m + p + u + t + e + r -> internal_id_24663 and it shapes the text with that replacement instead.


r


c + o + m + p + u + t + e + r -> internal_id_24663



Of course, if the internal glyph id is exposed through a CMAP table, you can also just access the glyph directly, by just specifying the USHORT that the font will be given to shape. In your example, you're directly specifying your Fontawesome icon by writing it outs unicode codepoint using a hex value rather than "a letter": the font uses the (by now defacto standard) platform-agnostic Unicode mapping, and you've asked for the glyph associated with code point 0xF017, which is in the "Private Use Area" block (a range of code points from E0x000` to 0xF8FF that don't have prescribed labels for each point: vendors can put whatever they need in them, and are not guaranteed to resolve to the same glyph between versions).



I've written about font internals, including these parts, over on http://pomax.github.io/CFF-glyphlet-fonts






This is very helpful, thank you. @AndreyTyukin true tho lol, not sure if fontawesome actually works with typing.

– Lance Pollard
Sep 18 '18 at 19:27






If the internal glyph id is exposed through a cmap you can access it by just using its number, provided your font's been told which character mapping it's supposed to use.

– Mike 'Pomax' Kamermans
Sep 18 '18 at 19:28


cmap






I filed an issue to add GSUB support to Fontawesome back in 2012. Five years later I closed it due to inactivity. I suspect FA still does not support GSUB, unlike fonts such as SymbolSet

– Mike 'Pomax' Kamermans
Sep 18 '18 at 19:39




Glyphs are not "mapped to keyboard characters". It's rather the other way around, and it happens in multiple steps:



Then the component of your operating system that is responsible for the keyboard layout transforms the scancodes into codepoints of the characters. For example, on my current operating system (Linux), the mapping from scancodes to code points is defined in a configuration file somewhere in /usr/share/X11/xkb/symbols. Here is an excerpt from the configuration file:


/usr/share/X11/xkb/symbols


key <AD01> [ q, Q, adiaeresis, Adiaeresis ] ;
key <AD02> [ w, W, aring, Aring ] ;
key <AD03> [ e, E, eacute, Eacute ] ;
key <AD04> [ r, R, registered, registered ] ;
key <AD05> [ t, T, thorn, THORN ] ;
key <AD06> [ y, Y, udiaeresis, Udiaeresis ] ;
key <AD07> [ u, U, uacute, Uacute ] ;
key <AD08> [ i, I, iacute, Iacute ] ;
key <AD09> [ o, O, oacute, Oacute ] ;
key <AD10> [ p, P, odiaeresis, Odiaeresis ] ;



For example, it says that the scancode <AD04> is mapped to the letter "r".


<AD04>



The symbols that you type on your keyboard using your layout somehow end up in the memory of some server, and are then served as a part of an HTML-document.



The CSS that accompanies the HTML document specifies that a particular font (e.g. Font Awesome) is to be used.



This particular font specifies that, for example, the unicode code point U+F420 should be rendered as the logo of Angular, this is why you see a hexagonal glyph with an A in your browser.


A



In this whole pathway, there is no fundamental difference between the Angular logo and any other character. The minor differences are:



If you wanted to type text with such glyphs anyway, you would have to:



Thanks for contributing an answer to Stack Overflow!



But avoid



To learn more, see our tips on writing great answers.



Required, but never shown



Required, but never shown




By clicking “Post Your Answer”, you agree to our terms of service, privacy policy and cookie policy

Popular posts from this blog

𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

Crossroads (UK TV series)

ữḛḳṊẴ ẋ,Ẩṙ,ỹḛẪẠứụỿṞṦ,Ṉẍừ,ứ Ị,Ḵ,ṏ ṇỪḎḰṰọửḊ ṾḨḮữẑỶṑỗḮṣṉẃ Ữẩụ,ṓ,ḹẕḪḫỞṿḭ ỒṱṨẁṋṜ ḅẈ ṉ ứṀḱṑỒḵ,ḏ,ḊḖỹẊ Ẻḷổ,ṥ ẔḲẪụḣể Ṱ ḭỏựẶ Ồ Ṩ,ẂḿṡḾồ ỗṗṡịṞẤḵṽẃ ṸḒẄẘ,ủẞẵṦṟầṓế