Discovering Spatial Locality in WWW Access Patterns using Data Mining of Document Clusters in Server Logs

Date
1997-09-10
DOI
Authors
Bestavros, Azer
Version
OA Version
Citation
Bestavros, Azer. "Discovering Spatial Locality in WWW Access Patterns using Data Mining of Document Clusters in Server Logs", Technical Report BUCS-1997-016, Computer Science Department, Boston University, August 28, 1997. [Available from: http://hdl.handle.net/2144/3773]
Abstract
In this paper, we introduce the notion of a "document cluster" in WWW space as a generalization of the notion of a "cache line" in linear memory address space. Through the analysis of Web server logs, we show evidence of the spatial locality of reference in WWW access patterns and present an implementation of an efficient data mining algorithm that discovers document clusters. We show preliminary simulation results that quantify the benefits of using document clusters for file allocation on server disks, as well as for purposes of prefetching into server cache/main memory.
Description
Only the abstract for this technical report is available.
License