Once of the projects we've been persuing over the last year is the "Almost Universal Microformats Parser", a Python library that does a pretty good job of breaking apart microformats. You can run the AUMFP against any webpage here or download (currently an old version of) the source here.
We've made quite a few modifications in the last two weeks and we're getting ready to release a new version of the source and also a few tools based on this project. We'll wait till we have more documentation and testing in place before we do the release though. We do the include-pattern thingie now and the interface makes a lot more sense.
As part of the update, we've extended the parser to handle hResume and tested against the samples pointed to on the Wiki. We also tried to identify places where documents don't conform to the proposed standard and document them as quirks (in general, we write our software to fail-as-last-resort). Here's the results of our testing:
-
Kenn Wilson has marked up his resume: [parsed ok
- quirks: broken skill rel-tag]. - Ralph Brandi has marked up his resume: [parsed ok - quirks: missing "contact"]
- Pat Ramsey has his resume: [parsed ok - quirks: missing "contact"].
- Wim Le Page has also marked up his curriculum vitae: [parsed ok - quirks: missing "contact"].
- Jonathan Arkell has posted an hResume: [parsed ok].
- Steve Ganz - hResume 0.1: [parsed ok]: [parsed ok - quirks: missing "contact"].
-
Dave Cardwell has marked up his curriculum vitae: [parsed ok
broken - uses "resume" root element]. - Edward O'Connor's resume: [broken - no hResume content!].
- Lindsey Simon has his resume: [parsed ok - quirks: missing "contact"].
- Ben Ward has published his CV: [parsed ok - quirks: missing "contact"].
- Kelley Chambers has published her resume: [broken - no "hresume" root element].
Note that the "contact" quirk may be a misinterpretation on my part.

