Discovering Spatial Locality in WWW Access Patterns using Data Mining of Document Clusters in Server Logs
MetadataShow full item record
CitationBestavros, Azer. "Discovering Spatial Locality in WWW Access Patterns using Data Mining of Document Clusters in Server Logs", Technical Report BUCS-1997-016, Computer Science Department, Boston University, August 28, 1997. [Available from: http://hdl.handle.net/2144/3773]
In this paper, we introduce the notion of a "document cluster" in WWW space as a generalization of the notion of a "cache line" in linear memory address space. Through the analysis of Web server logs, we show evidence of the spatial locality of reference in WWW access patterns and present an implementation of an efficient data mining algorithm that discovers document clusters. We show preliminary simulation results that quantify the benefits of using document clusters for file allocation on server disks, as well as for purposes of prefetching into server cache/main memory.
Only the abstract for this technical report is available.