Is there a (near) universal standard for sorting titles beginning with contractions?

The tags for music can be unicode. Itunes has supported this since about version 2. Most other players like winamp and windows media player also support unicode tags.

Even so, I don’t think most commercial software really uses STRICT ASCII/Unicode ordering, in other words, the programmer isn’t likely to just use

if strcmp(a,b) < 0 then

if strcmp(a,b) = 0 then

if strcmp(a,b) > 0 then
They’ll likely implement their own comparator. One likely change is to ignore case, otherwise all of your titles starting with a capital letter will come before all titles with lowercase. Hell, even in iTunes you can see they clearly use a custom comparitor because it ignores parts of strings (like leading spaces and words like The). When you’ve implemented a custom comparator.

ASCII/Unicode ordering isn’t wholly irrelevant, for “special” characters like apostrophes they can be used as a tiebreaker, but the chances of serious commercial software just using what amounts to strcmp (and thus organizing everything in STRICT character code ordering) are pretty small.

I’d say that so it is. I got into it when I went from cataloguer to programmer in charge of editing various sorts of catalogue outputs and inherited a system from my predecessor.

One thing that makes it impossible to implement a universal standard is that different languages use different order between the same characters and a letter that is a character in its own right in one language is not in another. Take, for instance, a, á, à, ã, å and ä. Should they all be treated as a? In Swedish the answer is no, as a, å and ä are different letters. And if you should put the others in a certain order, which order would that be?

The we come to digits. Should they go before or after letters? Should they be sorted as separate characters or should they be sorted according to their number values: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 … 20, 21 etc?

Mac, Mc, M’ has already been discussed. We all treat them as mac. On top of that we have many names ending with -son, or, in most cases, -sson. They are all uniformed to -sson.

Should space be treated as a word separator or not? Some sorting standards do, some don’t. We do and treat “-” as space as well. All other non letter or digit characters are omitted.

Also upper case comes before lower case. Finder and Windows Explorer are programmed to ignore case, but when listing files in bash, Aardvark will come before Zebra, but Zebra will come before aardvark.

Did you at least change the color of the cover of the manual?