Parasite: Mining Structural Information on the Web
Used these assumptions to propose applications for finding:
- moved pages
- related pages
- people
Comments
- Brute strength approach
- This and preceding paper (on Ahoy!) show importance of directory naming conventions (directory names provide metadata - what can we guess from the URL www.cs.acme.edu/staff/jsmith)