.doc, .xls, and .ppt files, and generates an OOXML version of each. In the old days, you’d use VBA to do this kind of thing. I’m behind on this kind of technology, but I assume there’s something on Windows that would make this tractable?xmlparser.rb, you can switch parsers, just as Nick Sieger has done for JRuby with JREXML. But I digress.) In the short term, we need to see if the REXML maintainers are responsive to Sam’s patch.&#xBabe;... or you could rewrite it in Ruby and It Would Be Much Better”. I shrieked “Get thee behind me foul tempter!” and have now jammed everything into 7-bit ASCII as it comes out of the XML parser, and of course all the problems have gone away. Actually, the code got simpler, lots of XML escaping/unescaping calls are no longer necessary. This is one of the nice things about XML I guess, it allows you to be a good internationalization citizen even when your software infrastructure isn’t. It still feels evil. Anyhow, the whole site’s been republished, let me know if anything’s busted. (By the way, if you’re reading this in my RSS feed and all the entries show up as new, switch to the Atom feed and that problem will go away, because Atom actually has unique IDs and datestamps that work.) [Updated: Tony Coates (interesting new blog there, BTW) reports that Opera 8.02 gets it backwards, which means that it’s one of the rare pieces of software that respects guids in RSS, but that it’s doing Atom 1.0 wrong.]< in the title caused all sorts of grief and breakage, both here at ongoing and downstream in the world of syndication and aggregation. I can fix my own problems, but it’s deeper downstream; long term, the answer is Atom. Herewith some thoughts on good programming practices and the larger problem. [Update: A couple of notes on the “href problem.”] ...attribute which has an attribute whose name is name. It makes it hard to talk about the XPaths without a lot of stuttering.genxScanUTF8() method, so it’s not entirely vapor. Upon consideration, I think it will be virtually no extra work to make it emit Canonical XML, ready to be signed, sealed and delivered (and Rich Salz said he would help) so why not? Major thanks to Anthony J. Starks for the name—I am not a member of Gen X myself, but I do share a city with Coupland, so there you go. Since ongoing doesn’t have comments, I’ll post a pointer to this item over in the xml-dev mailing list, which is a natural place to discuss it. It would be very surprising if this first-cut sketch didn’t contain some stupid errors, so go get ’em.xml-dev mailing list. Like many electronic communities, xml-dev suffers from a few tedious permathreads, from regular childish ranting, and from side-trips into the abstruse. But if you ask a hard technical question on XML there, you'll probably get an answer, almost immediately. The problem is that the mailing list is mismanaged, broken, unreliable, inaccessible, and really ought to find a new home with competent grownup minders ...xml-dev mailing list, the original XML-zealot conclave and home to most of the people in the world who worry seriously about XML in general; a very special and fortunately small shared obsession ...