Apple has refuted using unethically obtained data to train Apple Intelligence — but it has acknowledged its use for another project. On Tuesday, it was learned that an AI research lab called EleutherAI had harvested subtitles from YouTube videos without express permission from the creators. It also gathered data from Wikipedia, the English Parliament, and Enron staff emails. The data was then added to a dataset called “the Pile.”
