An RDF crawler
I wrote an RDF crawler (aka scutter) using Java and the Jena RDF toolkit that spiders the web gathering up semantic web data and storing it in any of Jena's backend stores (in-memory, Berkeley DB, mysql, etc).
http://www.hackdiary.com/archives/000030.html