Sunday, March 6, 2011

How to convert ANSEL text to UTF-8

How to change text encoded in ANSEL to UTF-8 in C#?

From stackoverflow
  • This is a non-trivial conversion as Windows/.NET Framework does not have an ANSEL codepage. See here for the travails of another person attempting this conversion.

  • Joshperry is correct. Eichmann's site has basically the ONLY documentation around that attempts to explain ANSEL encoding. Unfortunately there is no program code there, so you'll have to code it yourself.

    There is another code table (dated Dec 2007 - I didn't know anyone was still interested) for ANSEL at: http://lcweb2.loc.gov/diglib/codetables/45.html which is worthwhile comparing Eichmann's work to, since he did admit there were a few codes he's unsure of.

    If you are interested from a genealogy point of view, the GEDCOM standard release 5.5 has an abbreviated ANSEL character set and some encodings in its Appendix D.

    And if you got here because you are interested or have some involvement in genealogy programming, please check in at: http://stackoverflow.com/questions/338664/genealogy-anyone

  • There is an AnselEncoding class (deriving from the standard Encoding class) available as part of Gedcom.NET at http://sourceforge.net/projects/gedcom-net/ that also includes support for some of the LDS extensions.

0 comments:

Post a Comment