Our extractor tool was not calling setlocale(), thus it only produced output in the C locale, ie ASCII. Oops.