Wikipedia and Google delude many into thinking all human knowledge has been made available to our immediate access.
This is a dangerous lie.
Any scholar will tell you of the vast amount of documents and artifacts completely unavailble online, or indeed impossible to easily scan or study online.
And this silent wealth pales before the oral and analog culture which, in a McLuhanian dimension, can not be simply digitized and maintain integrity.
@Shufei when I was a journalist, investigative projects were dependent on examination of paper, of seeing associations and connections you wouldn't see otherwise. data is nice but has caveats. with a browser search you replace your own brain and eyes with someone else's unknown algorithm sweeping unknown data which, as you correctly point out, is limited. To me, nothing more fun than to go through Registry of Deeds books looking for hidden shell corporations! cheers, g
A good point to stress. These algorithms can both intentionally obfuscate and obfuscate via the "wisdom" of crowds, intensifying biases and received opinions.
I'm further concerned about the dubious fidelity of much OCR texts. Many Chinese manuscripts have enormously high error rates when so scanned, I've found, rendering them only useful as a first run search, and certainly not for reading.
"I appreciate SDF but it's a general-purpose server and the name doesn't make it obvious that it's about art." - Eugen Rochko