for more info, see unicode standard, ch 4