{"id":20195,"date":"2025-03-19T19:54:59","date_gmt":"2025-03-19T19:54:59","guid":{"rendered":"https:\/\/www.hotelsalepage.com\/feed\/cision-pr-newswire\/announcing-the-multilingual-conversational-speech-language-model-mlc-slm-challenge\/"},"modified":"2025-03-19T19:54:59","modified_gmt":"2025-03-19T19:54:59","slug":"announcing-the-multilingual-conversational-speech-language-model-mlc-slm-challenge","status":"publish","type":"post","link":"https:\/\/thaipropertynews.com\/feeds\/?p=20195","title":{"rendered":"Announcing the Multilingual Conversational Speech Language Model (MLC-SLM) Challenge"},"content":{"rendered":"<p><span class=\"legendSpanClass\"><span class=\"xn-location\">MONROVIA, Calif.<\/span><\/span>, <span class=\"legendSpanClass\"><span class=\"xn-chron\">March 19, 2025<\/span><\/span> \/PRNewswire\/ &#8212; Nexdata, a leading global provider of AI data services today announces the start of The Multilingual Conversational Speech LLM (MLC-SLM) Challenge, an officially approved satellite event of Interspeech 2025.<\/p>\n<p>This challenge, hosted by Meta, Google, Samsung, Naver, China Mobile, Northwestern Polytechnical University and Nexdata, aims to advance multilingual conversational speech AI by providing a real-world dataset and encouraging innovation in speech language models.<\/p>\n<p>The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs):<\/p>\n<p><b>Task I: Multilingual Conversational Speech Recognition<\/b><\/p>\n<p>Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.<\/p>\n<p><b>Task II: Multilingual Conversational Speech Diarization and Recognition<\/b><\/p>\n<p>Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.<\/p>\n<p>The training set (Train) comprises approximately 11 languages: English (en), French (fr), <span class=\"xn-person\">German (de)<\/span>, Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi). It&#8217;s designed to provide a rich resource for training and evaluating multilingual conversational speech language models (MLC-SLM), addressing the challenges of linguistic diversity, speaker variability, and contextual understanding.<\/p>\n<p><b>Important Dates (AOT Time)<\/b><\/p>\n<p><span class=\"xn-chron\">March 10, 2025<\/span>: Registration opens<br \/><span class=\"xn-chron\">March 15, 2025<\/span>: Training data release<br \/><span class=\"xn-chron\">March 20, 2025<\/span>: Development set and baseline system release<br \/><span class=\"xn-chron\">May 15, 2025<\/span>: Evaluation set release and Leaderboard open<br \/><span class=\"xn-chron\">May 30, 2025<\/span>: Leaderboard freeze and paper submission portal opens (CMT system)<br \/><span class=\"xn-chron\">June 15, 2025<\/span>: Paper submission deadline<br \/><span class=\"xn-chron\">July 1, 2025<\/span>: Notification of acceptance<br \/><span class=\"xn-chron\">August 18, 2025<\/span>: Workshop date<\/p>\n<p>We have set a prize pool of <span class=\"xn-money\">$20,000<\/span> for the winners. Based on performance, the top three teams in each track will be awarded:<br \/>1st Prize: <span class=\"xn-money\">$5,000<\/span><br \/>2nd Prize: <span class=\"xn-money\">$3,000<\/span><br \/>3rd Prize: <span class=\"xn-money\">$2,000<\/span><\/p>\n<p>For more details, please check out the challenge website: <a href=\"https:\/\/www.nexdata.ai\/competition\/mlc-slm\" target=\"_blank\" rel=\"nofollow\">https:\/\/www.nexdata.ai\/competition\/mlc-slm<\/a>\u00a0<\/p>\n<p>Participate here: <a href=\"https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ\/viewform?usp=send_form\" target=\"_blank\" rel=\"nofollow\">https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ\/viewform?usp=send_form<\/a>\u00a0<\/p>\n<p class=\"prntaj\"><span>For inquiries:\u00a0<a href=\"mailto:mlc-slmw@nexdata.ai\" target=\"_blank\" rel=\"nofollow\">mlc-slmw@nexdata.ai<\/a>\u00a0<\/span><\/p>\n<p>Join us in shaping the future of multilingual conversational AI and be part of this groundbreaking challenge!<\/p>\n<p><b>About Nexdata<\/b><\/p>\n<p>Nexdata provides top-notch training data solutions and serves as your reliable partner. With an extensive array of off-the-shelf datasets and flexible data collection and annotation services, our mission revolves around unleashing AI&#8217;s full potential and expediting the AI industry&#8217;s growth.<\/p>","protected":false},"excerpt":{"rendered":"<p><!-- wp:html --><\/p>\n<p><span class=\"legendSpanClass\"><span class=\"xn-location\">MONROVIA, Calif.<\/span><\/span>, <span class=\"legendSpanClass\"><span class=\"xn-chron\">March 19, 2025<\/span><\/span> \/PRNewswire\/ &#8212; Nexdata, a leading global provider of AI data services today announces the start of The Multilingual Conversational Speech LLM (MLC-SLM) Challenge, an officially approved satellite event of Interspeech 2025.<\/p>\n<p>This challenge, hosted by Meta, Google, Samsung, Naver, China Mobile, Northwestern Polytechnical University and Nexdata, aims to advance multilingual conversational speech AI by providing a real-world dataset and encouraging innovation in speech language models.<\/p>\n<p>The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs):<\/p>\n<p><b>Task I: Multilingual Conversational Speech Recognition<\/b><\/p>\n<p>Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.<\/p>\n<p><b>Task II: Multilingual Conversational Speech Diarization and Recognition<\/b><\/p>\n<p>Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.<\/p>\n<p>The training set (Train) comprises approximately 11 languages: English (en), French (fr), <span class=\"xn-person\">German (de)<\/span>, Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi). It&#8217;s designed to provide a rich resource for training and evaluating multilingual conversational speech language models (MLC-SLM), addressing the challenges of linguistic diversity, speaker variability, and contextual understanding.<\/p>\n<p><b>Important Dates (AOT Time)<\/b><\/p>\n<p><span class=\"xn-chron\">March 10, 2025<\/span>: Registration opens<br \/><span class=\"xn-chron\">March 15, 2025<\/span>: Training data release<br \/><span class=\"xn-chron\">March 20, 2025<\/span>: Development set and baseline system release<br \/><span class=\"xn-chron\">May 15, 2025<\/span>: Evaluation set release and Leaderboard open<br \/><span class=\"xn-chron\">May 30, 2025<\/span>: Leaderboard freeze and paper submission portal opens (CMT system)<br \/><span class=\"xn-chron\">June 15, 2025<\/span>: Paper submission deadline<br \/><span class=\"xn-chron\">July 1, 2025<\/span>: Notification of acceptance<br \/><span class=\"xn-chron\">August 18, 2025<\/span>: Workshop date<\/p>\n<p>We have set a prize pool of <span class=\"xn-money\">$20,000<\/span> for the winners. Based on performance, the top three teams in each track will be awarded:<br \/>1st Prize: <span class=\"xn-money\">$5,000<\/span><br \/>2nd Prize: <span class=\"xn-money\">$3,000<\/span><br \/>3rd Prize: <span class=\"xn-money\">$2,000<\/span><\/p>\n<p>For more details, please check out the challenge website: <a href=\"https:\/\/www.nexdata.ai\/competition\/mlc-slm\" target=\"_blank\" rel=\"nofollow\">https:\/\/www.nexdata.ai\/competition\/mlc-slm<\/a>\u00a0<\/p>\n<p>Participate here: <a href=\"https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ\/viewform?usp=send_form\" target=\"_blank\" rel=\"nofollow\">https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ\/viewform?usp=send_form<\/a>\u00a0<\/p>\n<p class=\"prntaj\"><span>For inquiries:\u00a0<a href=\"mailto:mlc-slmw@nexdata.ai\" target=\"_blank\" rel=\"nofollow\">mlc-slmw@nexdata.ai<\/a>\u00a0<\/span><\/p>\n<p>Join us in shaping the future of multilingual conversational AI and be part of this groundbreaking challenge!<\/p>\n<p><b>About Nexdata<\/b><\/p>\n<p>Nexdata provides top-notch training data solutions and serves as your reliable partner. With an extensive array of off-the-shelf datasets and flexible data collection and annotation services, our mission revolves around unleashing AI&#8217;s full potential and expediting the AI industry&#8217;s growth.<\/p>\n<p><!-- \/wp:html --><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"rop_custom_images_group":[],"rop_custom_messages_group":[],"rop_publish_now":"initial","rop_publish_now_accounts":[],"rop_publish_now_history":[],"rop_publish_now_status":"pending","footnotes":""},"categories":[5,7],"tags":[],"class_list":["post-20195","post","type-post","status-publish","format-standard","hentry","category-cision-pr-newswire","category-cision-pr-newswire-en"],"_links":{"self":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts\/20195","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=20195"}],"version-history":[{"count":0,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=\/wp\/v2\/posts\/20195\/revisions"}],"wp:attachment":[{"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=20195"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=20195"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/thaipropertynews.com\/feeds\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=20195"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}