I'm not sure if members of this list would find this type of e-mail useful (please let me know if you do... or don't), but here goes:
A common problem in text encoding is locating the correct codes for "unusual letters". There are various utilities for doing this in windows, mac, and Linux. But here is a useful web-based utility.You can use it to look up character names and find their code point (though you do have to be fairly precise), and it will produce the correct number in hex and decimal formats. It will also tell you everything you ever wanted to know about characters required for encoding Estonian.
See also the Digital Medievalist Wiki entry for character encoding http://sql.uleth.ca/dmorgwiki/index.php/Fonts (to which I have just added information about this site).
-dan
Daniel Paul O'Donnell wrote:
I'm not sure if members of this list would find this type of e-mail useful (please let me know if you do... or don't), but here goes:
I find it useful.
A common problem in text encoding is locating the correct codes for "unusual letters". There are various utilities for doing this in windows, mac, and Linux. But here is a useful web-based utility.You can use it to look up character names and find their code point (though you do have to be fairly precise), and it will produce the correct number in hex and decimal formats. It will also tell you everything you ever wanted to know about characters required for encoding Estonian.
Well, ok, I actually I knew about this particular site. I've used that and of course there is the unicode site itself, especially the charts page. Also, most linux distributions contain a graphical character-map utitilty that is searchable.
On of the things out of unicode recently is their report:
http://www.unicode.org/reports/tr22/
on CharMapML = Character Mapping Markup Language.
Readers might also be interested in drafts of: TEI P5 Draft Chapter 4: Language and Character Sets: http://www.tei-c.org/P5/Guidelines/CH.html and TEI P5 Draft Chapter 25: Representation of non-standard characters and glyphs http://www.tei-c.org/P5/Guidelines/WD.html
Just thought I'd add that in to Dan's comment. -James