xia@lemmy.sdf.orgM to micro-blog-[ish]@lemmy.sdf.orgEnglish · 8 days agoOne would think that all the books that Google has scanned and OCR'd would give them some kind of edge in the LLM-AI department (maybe a virtual research assistant?)message-squaremessage-square2fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareOne would think that all the books that Google has scanned and OCR'd would give them some kind of edge in the LLM-AI department (maybe a virtual research assistant?)xia@lemmy.sdf.orgM to micro-blog-[ish]@lemmy.sdf.orgEnglish · 8 days agomessage-square2fedilink
minus-squarePhilipTheBucketAlinkfedilinkEnglisharrow-up1·20 hours agoSadly, I think that the volume of books available to scan from all the books of the world is pretty small compared with the galaxy of random typing that is available all over the internet at this point.
minus-squarexia@lemmy.sdf.orgOPMlinkfedilinkEnglisharrow-up1·19 hours agoTo me that seems like it would only increase the collection’s value… one would want to train LLMs on good stuff, instead of garbage.
Sadly, I think that the volume of books available to scan from all the books of the world is pretty small compared with the galaxy of random typing that is available all over the internet at this point.
To me that seems like it would only increase the collection’s value… one would want to train LLMs on good stuff, instead of garbage.