ÀÚµ¿Â÷¿ë À½¼º »ê¾÷ º¸°í¼­(2025³â)
Automotive Voice Industry Report, 2025
»óǰÄÚµå : 1660091
¸®¼­Ä¡»ç : ResearchInChina
¹ßÇàÀÏ : 2025³â 02¿ù
ÆäÀÌÁö Á¤º¸ : ¿µ¹® 340 Pages
 ¶óÀ̼±½º & °¡°Ý (ºÎ°¡¼¼ º°µµ)
US $ 4,300 £Ü 6,052,000
Unprintable PDF (Single User License) help
PDF º¸°í¼­¸¦ 1¸í¸¸ ÀÌ¿ëÇÒ ¼ö ÀÖ´Â ¶óÀ̼±½ºÀÔ´Ï´Ù. Àμ⠺Ұ¡´ÉÇϸç, ÅØ½ºÆ®ÀÇ Copy&Pasteµµ ºÒ°¡´ÉÇÕ´Ï´Ù.
US $ 6,400 £Ü 9,008,000
Printable & Editable PDF (Enterprise-wide License) help
PDF º¸°í¼­¸¦ µ¿ÀÏ ±â¾÷ÀÇ ¸ðµç ºÐÀÌ ÀÌ¿ëÇÒ ¼ö ÀÖ´Â ¶óÀ̼±½ºÀÔ´Ï´Ù. Àμ⠰¡´ÉÇϸç Àμ⹰ÀÇ ÀÌ¿ë ¹üÀ§´Â PDF ÀÌ¿ë ¹üÀ§¿Í µ¿ÀÏÇÕ´Ï´Ù.


Çѱ۸ñÂ÷

1. ÀÚµ¿Â÷¿ë À½¼ºÀÇ Å¾ÀçÀ²Àº 83%ÀÌ ³Ñ°í, °í±Þ À½¼º ±â´ÉÀÇ Å¾ÀçÀ²ÀÌ ´ëÆø Áõ°¡ÇÕ´Ï´Ù.

2024³â 1¿ù-11¿ù, ÀÚµ¿Â÷¿ë À½¼º ½Ã½ºÅÛÀÇ Å¾Àç¼ö´Â 1,676¸¸´ë, žÀçÀ²Àº 83.3%°¡ µÇ¾ú½À´Ï´Ù. 2023³âµµ¿Í ºñ±³Çϸé žÀçÀ²ÀÌ 5%Æ÷ÀÎÆ® »ó½ÂÇß½À´Ï´Ù. ¿¡³ÊÁö À¯Çüº°·Î´Â EREV(Extended-Range Electric Vehicle)ÀÇ Å¾ÀçÀ²ÀÌ °¡Àå ³ô°í, 2024³â 1¿ù-11¿ùÀÇ Å¾ÀçÀ²Àº 100%¿¡ ´ÞÇß½À´Ï´Ù. ÀÌ ¿¡³ÊÁö À¯ÇüÀÇ ´ëÇ¥ÀûÀÎ ¸ðµ¨Àº Li Auto L ½Ã¸®Áî, AITO M ½Ã¸®Áî, Deepal S ½Ã¸®Áî µîÀÔ´Ï´Ù.

À½¼º ±â´ÉÀº 2024³â ¿¬¼Ó ´ëÈ­, ¾¾¾Ø½ºÇÇÅ©, ¿þÀÌÅ©¾÷ ÇÁ¸® ±â´ÉÀÇ Å¾Àç ¼ö¿Í žÀçÀ²ÀÌ Å©°Ô Áõ°¡Çß½À´Ï´Ù.

¾¾¾Ø½ºÇÇÅ© ±â´É¿¡¼­´Â 2024³â 1¿ù-11¿ùÀÇ ´©°è žÀç¼ö´Â 466¸¸´ë, žÀçÀ²Àº 23%·Î 2023³â Åë³â¿¡ ºñÇØ 18%Æ÷ÀÎÆ® Áõ°¡Çß½À´Ï´Ù. ¿¡³ÊÁö À¯Çüº°·Î´Â EREVÀÇ Å¾ÀçÀ²ÀÌ 92.1%·Î °¡Àå ³ô°í ¿¬·áÂ÷´Â 7.1%·Î °¡Àå ³·½À´Ï´Ù. °¡°Ý´ëº°·Î´Â 50¸¸ À§¾È À̻󿡼­´Â ¾¾¾Ø½ºÇÇÅ© ±â´ÉÀÇ Å¾ÀçÀ²ÀÌ °¡Àå ³ô°í Zeekr 009, Yangwang U8, NIO ES8 µîÀÌ ´ëÇ¥ÀûÀÎ ¸ðµ¨ÀÔ´Ï´Ù. ¶Ç, ÀÌ °¡°Ý´ëÀÇ Å¾ÀçÀ² Áõ°¡ÆøÀº °¡Àå Å©°í, 48%Æ÷ÀÎÆ® Áõ°¡Çß½À´Ï´Ù. ÀÌ´Â 2024³â¿¡ ÀÚµ¿Â÷ À½¼º ½Ã½ºÅÛÀÇ Áö´É ¼öÁØÀÌ Å©°Ô Çâ»óµÊÀ» º¸¿©ÁÖ¾ú½À´Ï´Ù.

2. Á¶Á¾¼®Àº ´õ ¸¹Àº »ýÅ ÀÚ¿ø¿¡ ¾×¼¼½ºÇϰí À½¼º ¾î½Ã½ºÅÏÆ®´Â ½ÉÃþ ¼­ºñ½º ±â´ÉÀ» È®º¸ÇÕ´Ï´Ù.

±âº» ¸ðµ¨ ½Ã´ë¿¡ "¸¹Àº °ÍÀ» ¾Ë°í ¼­ºñ½º¸¦ Á¦°øÇÒ ¼ö ÀÖ´Â" À½¼º ¾î½Ã½ºÅÏÆ®´Â ´Ù¾çÇÑ »ýÅÂ°è ¿ëµµ¿¡ ´ëÇÑ ¾×¼¼½º¿¡ Å©°Ô ÀÇÁ¸ÇÕ´Ï´Ù. ¿¹¸¦ µé¾î, »ç¿ëÀÚ°¡ 'ÀÚµ¿Â÷°¡ ²¨Áú °Í °°´Ù', '¹è°íÆÄ', '¼³³¯¿¡ ¹«¾ùÀ» ÀÔÀ¸¸é ÁÁÀ»±î'¶ó´Â ¸·¿¬ÇÑ ¸í·ÉÀ» ³»¸° °æ¿ì, À½¼º ¾î½Ã½ºÅÏÆ®ÀÇ ÀÀ´ä¿¡´Â Áöµµ, Áö¿ª »ýȰ ¼­ºñ½º, ¿Â¶óÀÎ Á¤º¸ µîÀÇ ¿ëµµÀ¸·ÎºÎÅÍÀÇ Áö¿øÀÌ ÇÊ¿äÇÕ´Ï´Ù.

AMAP, iQiyi, Tencent Video, NetEase Cloud Music, QQ Music°ú °°Àº ÀϹÝÀûÀÎ ¿ëµµ ¿Ü¿¡µµ Li Auto´Â Xiaohongshu(Little Red Book) Ç÷§ÆûÀÇ ÄÁÅÙÃ÷¿¡ ´ëÇÑ À½¼º ÅëÈ­¸¦ ±¸ÇöÇϰí MeituanÀ» À§ÇØ ±íÀº ¸ÂÃãÇü À½¼º ±â¼úÀ» ½ÃÀÛÇß½À´Ï´Ù. ¿¹¸¦ µé¾î 'Xiaohongshu°¡ ÃßõÇÏ´Â ¼³³¯ º¹Àå', 'Xiaohongshu¿¡¼­ º£ÀÌ¡ ¿©Çà °¡À̵å ã±â', 'Meituan¿¡¼­ Æò±Õ °¡°Ý 200À§¾È, ÆòÁ¡ 4.5ÃÊÀÇ ±¤µ¿ ¿ä¸® ·¹½ºÅä¶û ã±â' µîÀ» ¿äûÇϱâ À§ÇØ »ç¿ëÀÚ´Â Lixiang Tongxue¸¦ ½ÃÀÛÇÒ ¼ö ÀÖ½À´Ï´Ù.

3. ÆÄ¿îµ¥ÀÌ¼Ç ¸ðµ¨ ¾ÖÇø®ÄÉÀ̼ÇÀº '¸í·É »óÈ£ ÀÛ¿ë'¿¡¼­ 'ÀÎÁö »óÈ£ ÀÛ¿ë'À¸·Î ÀÚµ¿Â÷ À½¼º °³¹ßÀ» °¡¼ÓÈ­ÇÕ´Ï´Ù.

±âÁ¸ÀÇ ¸í·É »óÈ£ ÀÛ¿ë°ú´Â ´Þ¸®, ±â¹Ý ¸ðµ¨¿¡ ÀÇÇØ °­È­µÈ ÀÚµ¿Â÷ À½¼º ½Ã½ºÅÛÀº ¸»Çϱâ ÀÌÇØ, ³í¸®Àû Ãß·Ð, Áö½Ä Q&A, ȸȭ ÀÛ¼º, Â÷·® ÁÖº¯ ȯ°æ ÀνĿ¡ º¸´Ù ¿ì¼öÇÑ ´É·ÂÀ» ¹ßÈÖÇÕ´Ï´Ù.

¿¹¸¦ µé¾î, XPengÀÇ XGPT¸¦ žÀçÇÑ Xiao P ¾î½Ã½ºÅÏÆ®´Â À½¼º ¾ð¾î ÀÌÇØ, ³í¸®Àû Ãß·Ð, Áö½Ä ¹é°ú »çÀü, ȸȭ¡¤À̾߱⡤µ¿È­ÀÇ ÀÛ¼º, Â÷·® ÁÖÀ§ÀÇ ¹°Ã¼ÀÇ ÀÎ½Ä µîÀÇ ±â´ÉÀ» °®Ãß°í ÀÖ½À´Ï´Ù.

Li AutoÀÇ Mind GPT¸¦ žÀçÇÑ Lixiang Tongxue´Â Lixiang Tongxue¿¡ ¡¸¿µÈ­ÀÇ À̸§À» Àؾú½À´Ï´Ù. ÈæÀÎÀÇ ÇǾƴϽºÆ®°¡ ³ª¿À´Âµ¥, ¹«½¼ ¿µÈ­ÀÎÁö ¾Ë°Ú¾î?¡¹¶ó°í ¹°¾îº¸´Â ¸ðÈ£ÇÑ °Ë»ö ±â´É, Lixiang Tongxue°¡ ¿µÈ­ÀÇ ¼ö ÀÖ½À´Ï´Ù.

ÀÌ º¸°í¼­´Â Áß±¹ÀÇ ÀÚµ¿Â÷ »ê¾÷¿¡ ´ëÇÑ Á¶»ç ºÐ¼®À» ÅëÇØ ÀÚµ¿Â÷ À½¼º ½Ã½ºÅÛÀÇ Å¾Àç »óȲ, OEM ¹× °ø±Þ¾÷ü, »ê¾÷ üÀÎ, °³¹ß µ¿Ç⠵ ´ëÇÑ Á¤º¸¸¦ Á¦°øÇÕ´Ï´Ù.

¸ñÂ÷

Á¦1Àå ÀÚµ¿Â÷¿ë À½¼º »ê¾÷ÀÇ °³¿ä

Á¦2Àå OEMÀÇ ÀÚµ¿Â÷ À½¼º ½Ã½ºÅÛ ÀÌ¿ë

Á¦3Àå ÀÚµ¿Â÷¿ë À½¼º °ø±ÞÀÚ

Á¦4Àå ÀÚµ¿Â÷¿ë À½¼ºÀÇ »ê¾÷ üÀÎ

Á¦5Àå ÀÚµ¿Â÷¿ë À½¼ºÀÇ °³¹ß µ¿Çâ

KTH
¿µ¹® ¸ñÂ÷

¿µ¹®¸ñÂ÷

Automotive voice research: high-level voice function installation rate significantly increases, automotive voice moves towards "cognitive interaction"

From January to November 2024, installations of automotive voice systems reached 16.76 million units, with an installation rate of 83.3%. Compared to the full year of 2023, installations increased by 5 percentage points. By energy type, EREV (Extended-Range Electric Vehicle) had the highest installation rate for automotive voice systems, reaching 100% from January to November 2024. Typical models under this energy type include the Li Auto L series, AITO M series, and Deepal S series.

In terms of voice function, installations and installation rate for continuous dialogue, see-and-speak, and wake-up-free functions greatly increased in 2024.

For the see-and-speak function, from January to November 2024, its installations reached 4.66 million units, with an installation rate of 23%, an increase of 18 percentage points compared to the full year of 2023. By energy type, EREV had the highest installation rate at 92.1%, while fuel vehicles had the lowest at only 7.1%. By price range, the "see-and-speak" function had the highest installation rate in the over 500,000 RMB range, with representative models such as Zeekr 009, Yangwang U8, and NIO ES8. This range also saw the largest increase in installation rate, up by 48 percentage points. This also indicates a significant improvement in the intelligence level of automotive voice systems in 2024.

2. The cockpit accesses more ecological resources, voice assistants gain deep service capabilities

In the era of foundation models, a voice assistant that "knows a lot and can serve" relies more on the access to diverse ecological applications. For example, when users issue vague commands such as "the car is almost out of power," "I'm hungry," or "what should I wear for the Chinese New Year," the voice assistant's response requires support from applications like maps, local life services, and online information.

In addition to common applications like AMAP, iQiyi, Tencent Video, NetEase Cloud Music, and QQ Music, Li Auto has implemented voice calls to Xiaohongshu (Little Red Book) platform content and launched a deeply customized voice skill for Meituan. For example, users can wake up Lixiang Tongxue to ask " Chinese New Year outfits recommended by Xiaohongshu," "find a Beijing travel guide on Xiaohongshu," or "help me find a Cantonese restaurant on Meituan with an average price of 200 RMB and a rating above 4.5."

3. Foundation model applications accelerate the development of automotive voice from "command interaction" to "cognitive interaction"

Different from the previous command-based interaction, automotive voice systems empowered by foundation models have better capabilities in spoken language understanding, logical reasoning, knowledge Q&A, painting creation, and perceiving the vehicle's surrounding environment. For example:

XPeng's XGPT-powered Xiao P assistant has capabilities in spoken language understanding, logical reasoning, knowledge encyclopedia, painting & story & fairy tale creation, and recognizing objects around the vehicle.

Li Auto's Mind GPT-powered Lixiang Tongxue has fuzzy search capabilities, such as asking Lixiang Tongxue "I forgot the name of a movie, there's a black pianist, do you know what it is?"; search by image description, where Lixiang Tongxue can read movie poster content and express it freely, allowing children who cannot read to choose movies by describing the poster.

Xiaoai Tongxue's application of foundation models also gives it the ability to understand and respond to vague commands. For example, it can recognize and respond to commands like "Where is my phone?", "Turn off the lights at home", "What mountain is that ahead?", and "What car is that ahead?".

Taking XPeng Motors as an example, XPeng Motors has developed its own XGPT (Lingxi) foundation model and integrated it into the voice system. Additionally, it has integrated the ZhiPu AI base foundation model and multimodal models, giving the voice assistant Xiao P stronger language understanding, image recognition, and generation capabilities, which can be linked with in-vehicle perception system and external environment.

4. AI foundation models become a must-have for OEMs to build intelligent automotive voice systems

By 2024, the number of brands equipping their intelligent cockpits with foundation models has significantly increased, with Chinese independent brands being the primary drivers of this trend. Some brands have already completed the development path from cooperative supply to joint R&D, and finally to independent research. For example, in January 2024, Geely applied Baidu's ERNIE Bot foundation model in its Galaxy L6. In the same month, Geely released its self-developed full-scenario AI foundation model-Geely Xingrui AI Foundation Model.

Based on the Xingrui AI Foundation Model architecture, Geely has also developed derivative models such as the Xingrui NLP Language Foundation Model and the Xingrui Multimodal Foundation Model. Among these, the Xingrui NLP Language Foundation Model is entirely self-developed by the Xingrui Intelligent Computing Center, with a total training data volume exceeding 3 trillion tokens. It includes an emotional module, enabling excellent logical reasoning and contextual memory capabilities, allowing for human-like emotional interactions.

In January 2025, Geely showcased its development path for an in-cabin intelligent assistant based on the Xingrui AI Foundation Model at CES 2025-moving from "Assisted Intelligence" to "Agent Intelligence" and finally to "Autonomous Intelligence." With the support of the foundation model, in-car assistant will evolve from "accurately responding to commands" to "understanding the environment and autonomously completing tasks," and ultimately to "possessing self-awareness and autonomous emotional capabilities."

Chinese independent brands such as BYD, SAIC, Dongfeng, GAC, Changan, Chery, and emerging OEMs like NIO, Li Auto, XPeng, AITO, and Xiaomi have also implemented AI foundation models in automotive voice systems. As automotive intelligence enters its second phase, AI foundation models are gradually becoming a necessary option for building intelligent voice interaction systems.

Table of Contents

Related Definitions

1 Overview of Automotive Voice Industry

2 OEM Applications of Automotive Voice Systems

3 Automotive Voice Suppliers

4 Automotive Voice Industry Chain

5 Automotive Voice Development Trends

(ÁÖ)±Û·Î¹úÀÎÆ÷¸ÞÀÌ¼Ç 02-2025-2992 kr-info@giikorea.co.kr
¨Ï Copyright Global Information, Inc. All rights reserved.
PC¹öÀü º¸±â