ÀÚµ¿Â÷ ºÐ¾ßÀÇ AI ±â¹Ý ¸ðµ¨°ú Àû¿ë »ç·Ê(2024-2025³â)
Research Report on AI Foundation Models and Their Applications in Automotive Field, 2024-2025
»óǰÄÚµå : 1660087
¸®¼­Ä¡»ç : ResearchInChina
¹ßÇàÀÏ : 2025³â 02¿ù
ÆäÀÌÁö Á¤º¸ : ¿µ¹® 340 Pages
 ¶óÀ̼±½º & °¡°Ý (ºÎ°¡¼¼ º°µµ)
US $ 4,300 £Ü 6,221,000
Unprintable PDF (Single User License) help
PDF º¸°í¼­¸¦ 1¸í¸¸ ÀÌ¿ëÇÒ ¼ö ÀÖ´Â ¶óÀ̼±½ºÀÔ´Ï´Ù. Àμ⠺Ұ¡´ÉÇϸç, ÅØ½ºÆ®ÀÇ Copy&Pasteµµ ºÒ°¡´ÉÇÕ´Ï´Ù.
US $ 6,400 £Ü 9,260,000
Printable & Editable PDF (Enterprise-wide License) help
PDF º¸°í¼­¸¦ µ¿ÀÏ ±â¾÷ÀÇ ¸ðµç ºÐÀÌ ÀÌ¿ëÇÒ ¼ö ÀÖ´Â ¶óÀ̼±½ºÀÔ´Ï´Ù. Àμ⠰¡´ÉÇϸç Àμ⹰ÀÇ ÀÌ¿ë ¹üÀ§´Â PDF ÀÌ¿ë ¹üÀ§¿Í µ¿ÀÏÇÕ´Ï´Ù.


Çѱ۸ñÂ÷

Ãß·Ð ´É·ÂÀÌ ±â¹Ý ¸ðµ¨ÀÇ ¼º´ÉÀ» ¹Ð¾î ¿Ã¸³´Ï´Ù.

2024³â ÈÄ¹Ý ÀÌÈÄ Áß±¹ ³»¿ÜÀÇ ±â¹Ý ¸ðµ¨ ±â¾÷Àº Ãß·Ð ¸ðµ¨À» ¹ßÇ¥Çϰí Chain-of-Thought(CoT)¿Í °°Àº Ãß·Ð ÇÁ·¹ÀÓ¿öÅ©¸¦ »ç¿ëÇÏ¿© ±â¹Ý ¸ðµ¨ÀÌ º¹ÀâÇÑ ÀÛ¾÷À» ó¸®ÇÏ°í µ¶¸³ÀûÀ¸·Î ÀÇ»ç°áÁ¤À» ÇÒ ¼ö ÀÖ´Â ´É·ÂÀ» °­È­Çϰí ÀÖ½À´Ï´Ù.

Ãß·Ð ¸ðµ¨ÀÇ ÁýÁßÀûÀÎ ¸±¸®½º´Â º¹ÀâÇÑ ½Ã³ª¸®¿À¸¦ ó¸®Çϱâ À§ÇÑ ±â¹Ý ¸ðµ¨ÀÇ ´É·ÂÀ» °­È­Çϰí Agent ¿ëµµ¿¡ ´ëÇÑ ±âÃʸ¦ ±¸ÃàÇÏ´Â °ÍÀ» ¸ñÇ¥·Î ÇÕ´Ï´Ù. ¿¹¸¦ µé¸é, º¹ÀâÇÑ ½Ã¸Çƽ½º¿¡ À־ÀÇ ÄÛÇÍ ¾î½Ã½ºÅÏÆ®ÀÇ Àǵµ ÀνÄÀÇ °­È­³ª, ÀÚµ¿ ¿îÀü °èȹ¡¤°áÁ¤¿¡ À־ÀÇ ½Ã°ø°£ ¿¹ÃøÀÇ Á¤¹Ðµµ Çâ»ó µîÀÔ´Ï´Ù.

2024³â ÀÚµ¿Â÷¿¡ žÀçµÈ ÁÖ·ù ±â¹Ý ¸ðµ¨ÀÇ Ãß·Ð ±â¼úÀº ÁÖ·Î CoT¿Í ±× º¯Á¾, ¿¹¸¦ µé¾î ToT(Tree-of-Thought), GoT(Graph-of-Thought), FoT(Forest-of-Thought)¸¦ Áß½ÉÀ¸·Î Àü°³µÇ¾î »ý¼º ¸ðµ¨(¿¹¸¦ µé¸é È®»ê ¸ðµ¨), Áö½Ä ±×·¡ÇÁ, Àΰú Ãß·Ð ¸ðµ¨, ´©Àû Ãß·Ð ¹× ´ÙÁß ¸ðµå Ã߷РüÀΰú °áÇյǾú½À´Ï´Ù.

¿¹¸¦ µé¾î, Geely°¡ Á¦¾ÈÇÑ Modularized Thinking Language Model(MeTHanol)Àº ±â¹Ý ¸ðµ¨ÀÌ Àΰ£ÀÇ »ç°í¸¦ ÇÕ¼ºÇÏ¿© LLMÀÇ ¼û°ÜÁø ·¹À̾ °¨µ¶ÇÒ ¼ö ÀÖ°Ô Çϰí, Àΰ£°ú °°Àº »ç°í ÇൿÀ» »ý¼ºÇØ, ÀÏ»ó ´ëÈ­³ª °³ÀÎÈ­µÈ ÇÁ·ÒÇÁÆ®¿¡ ÀûÀÀÇÏ´Â °Í¿¡ ÀÇÇØ ´ë±Ô¸ð

2025³â Ã߷бâ¼úÀÇ ÃÊÁ¡Àº ¸ÖƼ¸ð´Þ Ãß·ÐÀ¸·Î ÀüȯµË´Ï´Ù. ÀϹÝÀûÀÎ Æ®·¹ÀÌ´× ±â¼úÀº ¸í·É ¹Ì¼¼ Á¶Á¤, ¸ÖƼ¸ð´Þ ÄÁÅØ½ºÆ® ÇнÀ, ¸ÖƼ¸ð´Þ CoT(M-CoT)¸¦ Æ÷ÇÔÇϸç, ¸¹Àº °æ¿ì ¸ÖƼ¸ð´Þ À¶ÇÕ Á¤·Ä°ú LLM Ãß·Ð ±â¼úÀ» °áÇÕÇÏ¿© °¡´ÉÇÕ´Ï´Ù.

¼³¸í °¡´É¼ºÀº AI¿Í »ç¿ëÀÚÀÇ ½Å·Ú °ü°è¸¦ ±³Â÷½Ãŵ´Ï´Ù.

»ç¿ëÀÚ´Â AIÀÇ "À¯¿ë¼º"À» °æÇèÇϱâ Àü¿¡ AI¸¦ ½Å·ÚÇØ¾ßÇÕ´Ï´Ù. 2025³â AI ½Ã½ºÅÛÀÇ ¼³¸í °¡´É¼ºÀº ÀÚµ¿Â÷ AI »ç¿ëÀÚ¸¦ ´Ã¸®´Â µ¥ Áß¿äÇÑ ¿ä¼ÒÀÔ´Ï´Ù. ÀÌ °úÁ¦´Â ±ä CoT¸¦ ÀÔÁõÇÔÀ¸·Î½á ÇØ°áÇÒ ¼ö ÀÖ½À´Ï´Ù.

AI ½Ã½ºÅÛÀÇ ¼³¸í °¡´É¼ºÀº µ¥ÀÌÅÍ ¼³¸í °¡´É¼º, ¸ðµ¨ ¼³¸í °¡´É¼º, »çÈÄ ¼³¸í °¡´É¼ºÀÇ ¼¼ °¡Áö ¼öÁØ¿¡¼­ ´Þ¼ºµÉ ¼ö ÀÖ½À´Ï´Ù.

Li AutoÀÇ °æ¿ì L3 ÀÚÀ²ÁÖÇàÀº 'AI Ãß·Ð ½Ã°¢È­ ±â¼ú'À» »ç¿ëÇÏ¿© ¿£µå Åõ ¿£µå VLM ¸ðµ¨ÀÇ »ç°í ÇÁ·Î¼¼½º¸¦ Á÷°üÀûÀ¸·Î Á¦½ÃÇϰí, ¹°¸® ¼¼°èÀÇ Áö°¢ ÀԷ¿¡¼­ ±â¹Ý ¸ðµ¨¿¡ ÀÇÇØ Ãâ·ÂµÇ´Â ¿îÀü ÆÇ´Ü±îÁöÀÇ Àüü ÇÁ·Î¼¼½º¸¦ Ä¿¹öÇϰí, Áö´ÉÇü µå¶óÀ̺ù ½Ã½ºÅÛ¿¡ ´ëÇÑ »ç¿ëÀÚÀÇ ½Å·Ú¸¦ ³ôÀ̰í ÀÖ½À´Ï´Ù.

Li AutoÀÇ "AI Ãß·Ð ½Ã°¢È­ ±â¼ú"¿¡¼­´Â

ÁÖÀÇ ½Ã½ºÅÛÀº Â÷·®ÀÌ ÀνÄÇÑ ±³Åë ¹× È¯°æ Á¤º¸¸¦ Ç¥½ÃÇϰí, ½Ç½Ã°£ ºñµð¿À ½ºÆ®¸²¿¡¼­ ±³Åë Âü°¡ÀÚÀÇ ÇൿÀ» Æò°¡Çϸç, È÷Æ®¸Ê¿¡¼­ Æò°¡ ´ë»óÀ» Ç¥½ÃÇÕ´Ï´Ù.

¿£µå Åõ ¿£µå(E2E) ¸ðµ¨Àº ÁÖÇà ±ËÀû Ãâ·Â µÚ¿¡ ÀÖ´Â »ç°í °úÁ¤À» º¸¿©ÁÝ´Ï´Ù. ÀÌ ¸ðµ¨Àº ´Ù¾çÇÑ ÁÖÇà ±ËÀû¿¡ ´ëÇØ »ý°¢Çϰí 10°³ÀÇ Ãâ·Â Èĺ¸ °á°ú¸¦ Á¦½ÃÇÏ¸ç ±Ã±ØÀûÀ¸·Î °¡Àå °¡´É¼ºÀÌ ³ôÀº Ãâ·Â °á°ú¸¦ ÁÖÇà ±ËÀûÀ¸·Î äÅÃÇÕ´Ï´Ù.

½Ã°¢ ¾ð¾î ¸ðµ¨(VLM)Àº Áö°¢, Ãß·Ð ¹× ÀÇ»ç °áÁ¤ °úÁ¤À» ´ëÈ­½ÄÀ¸·Î Ç¥½ÃÇÕ´Ï´Ù.

´Ù¾çÇÑ Ãß·Ð ¸ðµ¨ÀÇ »óÈ£ ÀÛ¿ë ÀÎÅÍÆäÀ̽º´Â À¯»çÇÏ°Ô Ãß·Ð ÇÁ·Î¼¼½º¸¦ ºÐÇØÇϱâ À§ÇØ ±ä CoT¸¦ äÅÃÇÕ´Ï´Ù. ¿¹¸¦ µé¾î, DeepSeek R1¿¡¼­´Â »ç¿ëÀÚ¿ÍÀÇ ´ëÈ­¿¡¼­ ¸ÕÀú CoT°¡ °¢ ³ëµå¿¡¼­ °áÁ¤À» Á¦½ÃÇÑ ´ÙÀ½ ÀÚ¿¬¾î·Î ¼³¸íÇÕ´Ï´Ù.

¶ÇÇÑ ZhipuÀÇ GLM-Zero-Preview, AlibabaÀÇ QwQ-32B-Preview, Skywork 4.0 o1 µî ´ëºÎºÐÀÇ Ãß·Ð ¸ðµ¨Àº ±ä CoT Ãß·Ð ÇÁ·Î¼¼½ºÀÇ ½Ã¿¬À» Áö¿øÇÕ´Ï´Ù.

ÀÌ º¸°í¼­´Â Áß±¹ÀÇ ÀÚµ¿Â÷ »ê¾÷¿¡ ´ëÇØ Á¶»çÇßÀ¸¸ç, AI ±â¹Ý ¸ðµ¨ÀÇ °³¿ä, À¯Çü, °øÅë ±â¼ú, ±â¾÷, ÀÚµ¿Â÷¿¡ÀÇ Àû¿ë »ç·Ê µîÀÇ Á¤º¸¸¦ Á¦°øÇÕ´Ï´Ù.

¸ñÂ÷

Á¦1Àå AI ±â¹Ý ¸ðµ¨ °³¿ä

Á¦2Àå ´Ù¸¥ À¯ÇüÀÇ AI ±â¹Ý ¸ðµ¨ ºÐ¼®

Á¦3Àå AI ±â¹Ý ¸ðµ¨ÀÇ °øÅë ±â¼ú

Á¦4Àå AI ±â¹Ý ¸ðµ¨ ±â¾÷

Á¦5Àå ÀÚµ¿Â÷¿¡¼­ÀÇ AI ±â¹Ý ¸ðµ¨ Àû¿ë »ç·Ê

Á¦6Àå AI ±â¹Ý ¸ðµ¨ÀÇ ÀÀ¿ë µ¿Çâ

KTH
¿µ¹® ¸ñÂ÷

¿µ¹®¸ñÂ÷

Research on AI foundation models and automotive applications: reasoning, cost reduction, and explainability

Reasoning capabilities drive up the performance of foundation models.

Since the second half of 2024, foundation model companies inside and outside China have launched their reasoning models, and enhanced the ability of foundation models to handle complex tasks and make decisions independently by using reasoning frameworks like Chain-of-Thought (CoT).

The intensive releases of reasoning models aim to enhance the ability of foundation models to handle complex scenarios and lay the foundation for Agent application. In the automotive industry, improved reasoning capabilities of foundation models can address sore points in AI applications, for example, enhancing the intent recognition of cockpit assistants in complex semantics and improving the accuracy of spatiotemporal prediction in autonomous driving planning and decision.

In 2024, reasoning technologies of mainstream foundation models introduced in vehicles primarily revolved around CoT and its variants (e.g., Tree-of-Thought (ToT), Graph-of-Thought (GoT), Forest-of-Thought (FoT)), and combined with generative models (e.g., diffusion models), knowledge graphs, causal reasoning models, cumulative reasoning, and multimodal reasoning chains in different scenarios.

For example, the Modularized Thinking Language Model (MeTHanol) proposed by Geely allows foundation models to synthesize human thoughts to supervise the hidden layers of LLMs, and generates human-like thinking behaviors, enhances the thinking and reasoning capabilities of large language models, and improves explainability, by adapting to daily conversations and personalized prompts.

In 2025, the focus of reasoning technology will shift to multimodal reasoning. Common training technologies include instruction fine-tuning, multimodal context learning, and multimodal CoT (M-CoT), and are often enabled by combining multimodal fusion alignment and LLM reasoning technologies.

Explainability bridges trust between AI and users.

Before users experience the "usefulness" of AI, they need to trust it. In 2025, the explainability of AI systems therefore becomes a key factor in increasing the user base of automotive AI. This challenge can be addressed by demonstrating long CoT.

The explainability of AI systems can be achieved at three levels: data explainability, model explainability, and post-hoc explainability.

In Li Auto's case, its L3 autonomous driving uses "AI reasoning visualization technology" to intuitively present the thinking process of end-to-end + VLM models, covering the entire process from physical world perception input to driving decision outputted by the foundation model, enhancing users' trust in intelligent driving systems.

In Li Auto's "AI reasoning visualization technology":

Attention system displays traffic and environmental information perceived by the vehicle, evaluates the behavior of traffic participants in real-time video streams and uses heatmaps to display evaluated objects.

End-to-end (E2E) model displays the thinking process behind driving trajectory output. The model thinks about different driving trajectories, presents 10 candidate output results, and finally adopts the most likely output result as the driving path.

Vision language model (VLM) displays its perception, reasoning, and decision-making processes through dialogue.

Various reasoning models' dialogue interfaces also employ a long CoT to break down the reasoning process as well. Examples include DeepSeek R1 which during conversations with users, first presents the decision at each node through a CoT and then provides explanations in natural language.

Additionally, most reasoning models, including Zhipu's GLM-Zero-Preview, Alibaba's QwQ-32B-Preview, and Skywork 4.0 o1, support demonstration of the long CoT reasoning process.

DeepSeek lowers the barrier to introduction of foundation models in vehicles, enabling both performance improvement and cost reduction.

Does the improvement in reasoning capabilities and overall performance mean higher costs? Not necessarily, as seen with DeepSeek's popularity. In early 2025, OEMs have started connecting to DeepSeek, primarily to enhance the comprehensive capabilities of vehicle foundation models as seen in specific applications.

In fact, before DeepSeek models were launched, OEMs had already been developing and iterating their automotive AI foundation models. In the case of cockpit assistant, some of them had completed the initial construction of cockpit assistant solutions, and connected to cloud foundation model suppliers for trial operation or initially determined suppliers, including cloud service providers like Alibaba Cloud, Tencent Cloud, and Zhipu. They connected to DeepSeek in early 2025, valuing the following:

Strong reasoning performance: for example, the R1 reasoning model is comparable to OpenAI o1, and even excels in mathematical logic.

Lower costs: maintain performance while keeping training and reasoning costs at low levels in the industry.

By connecting to DeepSeek, OEMs can really reduce the costs of hardware procurement, model training, and maintenance, and also maintain performance, when deploying intelligent driving and cockpit assistants:

Low computing overhead technologies facilitate high-level autonomous driving and technological equality, which means high performance models can be deployed on low-compute automotive chips (e.g., edge computing unit), reducing reliance on expensive GPUs. Combined with DualPipe algorithm and FP8 mixed precision training, these technologies optimize computing power utilization, allowing mid- and low-end vehicles to deploy high-level cockpit and autonomous driving features, accelerating the popularization of intelligent cockpits.

Enhance real-time performance. In driving environments, autonomous driving systems need to process large amounts of sensor data in real time, and cockpit assistants need to respond quickly to user commands, while vehicle computing resources are limited. With lower computing overhead, DeepSeek enables faster processing of sensor data, more efficient use of computing power of intelligent driving chips (DeepSeek realizes 90% utilization of NVIDIA A100 chips during server-side training), and lower latency (e.g., on the Qualcomm 8650 platform, with computing power of 100TOPS, DeepSeek reduces the inference response time from 20 milliseconds to 9-10 milliseconds). In intelligent driving systems, it can ensure that driving decisions are timely and accurate, improving driving safety and user experience. In cockpit systems, it helps cockpit assistants to quickly respond to user voice commands, achieving smooth human-computer interaction.

Table of Contents

Definitions

1 Overview of AI Foundation Models

2 Analysis of AI Foundation Models of Differing Types

3 Common Technologies in AI Foundation Models

4 AI Foundation Model Companies

5 Application Cases of AI Foundation Models in Automotive

6 Application Trends of AI Foundation Models

(ÁÖ)±Û·Î¹úÀÎÆ÷¸ÞÀÌ¼Ç 02-2025-2992 kr-info@giikorea.co.kr
¨Ï Copyright Global Information, Inc. All rights reserved.
PC¹öÀü º¸±â