One would think that all the books that Google has scanned and OCR'd would give them some kind of edge in the LLM-AI department (maybe a virtual research assistant?)

xia@lemmy.sdf.org · 8 days ago

One would think that all the books that Google has scanned and OCR'd would give them some kind of edge in the LLM-AI department (maybe a virtual research assistant?)

PhilipTheBucket · 20 hours ago

Sadly, I think that the volume of books available to scan from all the books of the world is pretty small compared with the galaxy of random typing that is available all over the internet at this point.

xia@lemmy.sdf.org · 19 hours ago

To me that seems like it would only increase the collection’s value… one would want to train LLMs on good stuff, instead of garbage.