MLCommonsとHugging FaceがAI研究のために大規模な音声データセットをリリース

MLCommons and Hugging Face team up to release massive speech dataset for AI research | TechCrunch

MLCommons and Hugging Face team up to release massive speech dataset for AI research | TechCrunch

The nonprofit AI safety org MLCommons has teamed up with Hugging Face to release a public domain dataset of speech recor...続きを読む

MLCommonsとHugging Faceが提携し、AI研究のために「Unsupervised People’s Speech」という大規模な音声データセットをリリースしました。このデータセットは、89の言語にわたる100万時間以上の音声を含み、主にパブリックドメインから収集されたものです。データは、商業利用や学術利用が可能なライセンスの下で提供されています。

このデータセットには、録音された人々が自分の声がAI研究に使用されることを知らない可能性があるため、倫理的な懸念も指摘されています。MLCommonsは、データの品質を維持し、改善することにコミットしていますが、研究者は使用に際して慎重になる必要があります。

Unsupervised People’s Speech,
https://huggingface.co/datasets/MLCommons/unsupervised_peoples_speech

月	火	水	木	金	土	日
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31