|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Google data collected by
Jonathan Lansey with an automatic program written by Etan Bukiet. Data from
Yahoo was collected with the help of Virgil Griffith. |
|
|
|
|
Sheet # |
Description |
|
|
|
0 |
You’re here, feast your eyes |
|
|
|
1 |
Data used for figs.3, 4. Googlewhacks taken from googlewhack.com |
|
|
|
2,3 |
Data used for fig. 5, 7 and 10 Combinations of words from the
googlewhack vocabulary |
|
|
|
4 |
Data used for fig. 6, Random selections from pairs on sheet 3 with a
third common word added from a list (triplets) |
|
|
|
5 |
Data used for fig 8. Yahoo and Google Comparison |
|
|
|
6 |
Determining the magnitude of the spread with the advanced search
options (from the discussion section) |
|
|
|
7 |
Switching word pairs, like Goddamned Toolboxes, and comparing results
(also from discussion) |
|
|
|
8 |
Data used for fig. 1, from the computational model used to verify the
linear approximation |
|
|
|
9 |
Strength of Associativity data collected from these worksheets and used
in Table 2 |
|
|
|
|
|
|
|
|
|
Format Key: |
|
|
|
|
|
|
|
|
a |
word a |
|
|
b |
word b |
|
|
c |
word c |
|
|
A |
Number of results from a search for word a |
|
|
B |
Number of results from a search for word b |
|
|
C |
Number of results from a search for word c |
|
|
A+B |
Number of results returned from words a and b
together in the same search |
|
|
A+B+C |
Number of results returned from words a,b and c together in the same search |
|
|
|
|
|
|
|
|
|
|
|
Values for I (index size) |
|
|
|
I |
8E+09 |
|
|
I(eff2) |
9E+08 |
|
|
I(eff3) |
31600000 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Copyright Jonathan C.
Lansey (2009) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|