[PDF] Balancing volume, quality and freshness in web crawling -
R Baeza-Yates, C Castillo - Soft Computing Systems-Design, Management and Applications, 2002 - ciw.cl
... Freshness can be estimated for most Web servers 1 ... 1 score=0.7 freshness=0.9 priority =
0.07 ... h 2 ('235 path/file.html') SITE id = 235; DOC id = 9421 INPUT ...
PCT NO: PCT/EP98/07664 371 Date: June 1, 2000 102 (e) Date: June 1, 2000 PCT PUB. NO.: WO99/27918 …
O Guide, C Education, T Courses, F Register, P … - pharmcast.com
... 1 0.09 60' 1.5 0.08 90' 3 0.08 120' 9 0.07 150 8 0.07 180 7.5 0.07 ... more about this
patent, please go directly to the US Patent and Trademark Office Web site to ...
-
[PS] Caching Strategies for Data-Intensive Web Sites -
K Yagoub, D Florescu, V Issarny, P Valduriez - The VLDB Journal, 2000 - www-rocq.inria.fr
... limitation is mainly due to the dynamic nature of many HTML doc- uments, which ... of
aairs even worse, rapidly increasing the percentage of dynamic Web documents. ...
-
Web server workload characterization: the search for invariants -
MF Arlitt, CL Williamson - ACM SIGMETRICS Performance Evaluation Review, 1996 - portal.acm.org
... then the response includes the requested doc- ument. If the request was unsuccessful,
a reason for the failure is returned to the client [15]. Once the Web ...
An investigation of documents from the World Wide Web -
A Woodruff, PM Aoki, E Brewer, P Gauthier, LA Rowe - Computer Networks and ISDN Systems, 1996 - Elsevier
... Web. Our data set, collected by the Inktomi 9 Web crawler, currently
comprises over 2.6 million 10 HTML doc- uments. We present ...
On Computing the Canonical Features of Software Systems -
J Kothari, T Denton, S Mancoridis, A Shokoufandeh - Proceedings of the 13th Working Conference on Reverse … - doi.ieeecomputersociety.org
... email-doc 0.07 0.22 0.3 0.2 0.12 0.14 1 0.38 0.37 0.08 0.34 ... Feature Name 1
Url-open-doc 2 Save 3 Paste 4 ... The Firefox suite includes a web-browser based on the ...
[PDF] Removal policies in network caches for world-wide web documents -
S Williams, M Abrams, CR Strandridge, G Abdulla, E … - Computer Communication Review, 1996 - cs.kent.edu
... than half the size of the incoming doc- ument; if ... dramat- ically reduce the load
on popular Web servers. ... Audio 0.09 3.15 0.07 1.47 0.21 2.93 2.57 87.78 0.25 ...
-
Reducing Program Comprehension Effort in Evolving Software by Recognizing Feature Implementation … -
J Kothari, T Denton, A Shokoufandeh, S Mancoridis - Proceedings of the 15th IEEE International Conference on …, 2007 - doi.ieeecomputersociety.org
... email-doc 0.07 0.22 0.3 0.2 0.12 0.14 1 0.38 0.37 ... listed (eg, Startup, File-Open,
URL-Open-Doc) against all ... The Firefox suite includes a web-browser based on ...
-
On evaluating web search with very few relevant documents -
I Soboroff - Proceedings of the 27th annual international conference on …, 2004 - portal.acm.org
... page duplication,there can be more than one target doc- ument,in ... is used,an abso-
lute difference of 0.07 is required ... and seeks a small set of key web sites on ...
Evaluating strategies for similarity search on the web -
TH Haveliwala, A Gionis, D Klein, P Indyk - … of the 11th international conference on World Wide Web, 2002 - portal.acm.org
... built summary of the target doc- ument [1 ... implicit in the hierarchical Web directories
mentioned ... 0.10 /home/gardens/clubs_and_associations 50 0.07 /home/gardens ...
Source: Google Scholar |