Sat Jan 7 11:32:45 EST 2006 rsc Allprint doesn't handle unicode input. You'll need to fix a lot more than the prototype to make it work. Lex in general doesn't work with UTF: the regular expression machinery it builds doesn't know about UTF. I don't understand the debug message fix -- what does negative input() return values mean? I looked at the definition of input() that lex uses and I don't see it returning negative numbers.