Also, the notion that you can’t have fantastic results with generative models training on only content you have permission to use is ridiculous. OpenAI and Meta are bad actors that are disengenuous. Its *easier* to get good results with ethical shortcuts, but you can acehieve amazing results without stealing.
@cleverdevil This is 100% correct. Properly curated training data (which we have not yet seen) will yield dramatically better LLM results. Not more reliable, mind you, but should avoid some of the creepy and dark stuff we have seen emerge. Curating requires humans and will be expensive.
fgtech, Jun 12 2024 on micro.blog