When I first registered XMLSpy (3.5) it asked me whether all HTML files would be XHTML compliant. That was about 5 years ago, so it's taken some time, but I can now confirm that, slashdot moans aside, msn.com is at least now valid XML.
I predict minor upturn in screen-scraping as an integration technique.
[/. isn't btw]