Although we've done amazing things like creating the library at Alexandria and great libraries from time to time ever since, we're really in a dark age of folly now.
We have desktop data stores of 10-100+ terabyte scale being moderately common, commercial ones hundreds of thousands times larger.
An average PC may well have space to hold more text that a human could ever read in their lifetime if they did nothing otherwise with their abilities.
And yet we're really phenomally sucking at creating ORGANIZED libraries of human knowledge. We create the internet yet we search all that with, what,
google or worse web sites without the most basic rudiment of actual organization of the content or advanced capacities of query?
We're creating personal "AI"s and cloud based ones too with like what 4k-200k "tokens" of context / prompt memory and nothing more?
Sure there's RAG and stuff but it seems pathetic.
We shouldn't have to be "digging through search results" we should have efficient ways to have organized the "sum of human knowledge" and advanced ways to search it.
Nevertheless there'd still be a lot to search, petabytes, whatever. But the one thing our "digital society" should be good at -- organizing / sharing information
we're not (besides in the domain of actual libraries and databases) and the one thing ML "assistants" should be good at -- NAVIGATING this mass of
human knowledge and doing specific research based on synthesizing, searching, correlating it -- they SUCK at.
I'd take SQL or some NOSQL query as a vast improvement over Google search but we don't even have that.
And out ML models (in the breadth of them) can't even manage XML / JSON / PDF et. al. well without a lot of poorly working layers of things to force them to kinda sorta work.
Originally posted by Chugworth
View Post
Leave a comment: