GenAI ¸ðµ¨ Æ®·¹ÀÌ´× ³Ê¸Ó : ½ÇÀü ȯ°æ¿¡¼­ AI Ãß·Ð ¿öÅ©·ÎµåÀÇ ºñ¿ë ¹× ·¹ÀÌÅϽÃÀÇ »è°¨°ú È®À强ÀÇ Çâ»ó
Beyond GenAI Model Training: Reducing Cost and Latency and Improving Scalability of AI Inferencing Workloads in Production
»óǰÄÚµå : 1817382
¸®¼­Ä¡»ç : IDC
¹ßÇàÀÏ : 2025³â 09¿ù
ÆäÀÌÁö Á¤º¸ : ¿µ¹® 18 Pages
 ¶óÀ̼±½º & °¡°Ý (ºÎ°¡¼¼ º°µµ)
US $ 7,500 £Ü 10,628,000
PDF (Single User License) help
PDF º¸°í¼­¸¦ 1¸í¸¸ ÀÌ¿ëÇÒ ¼ö ÀÖ´Â ¶óÀ̼±½ºÀÔ´Ï´Ù. Àμâ´Â °¡´ÉÇϸç Àμ⹰ÀÇ ÀÌ¿ë ¹üÀ§´Â PDF ÀÌ¿ë ¹üÀ§¿Í µ¿ÀÏÇÕ´Ï´Ù.


Çѱ۸ñÂ÷

IDC Perspective´Â »ý¼ºÇü AI(GenAI) Ãß·Ð ¿öÅ©·Îµå¸¦ ½ÇÀü ȯ°æ¿¡¼­ È®ÀåÇÒ ¶§ÀÇ °úÁ¦¿Í Çõ½ÅÀ» ޱ¸Çϰí, ºñ¿ë »è°¨, ·¹ÀÌÅϽà °³¼±, È®À强¿¡ ÁßÁ¡À» µÎ°í ÀÖ½À´Ï´Ù. Ãß·Ð ÆÛÆ÷¸Õ½º¸¦ ÃÖÀûÈ­Çϱâ À§ÇÑ ¸ðµ¨ ¾ÐÃà, ¹èġó¸®, ij½Ã, º´·ÄÈ­ µîÀÇ ¹æ¹ý¿¡ ´ëÇØ¼­µµ ÁßÁ¡ÀûÀ¸·Î ´Ù·ç°í ÀÖ½À´Ï´Ù. AWS, DeepSeek, Google, IBM, Microsoft, NVIDIA, Red Hat, Snowflake, WRITER µîÀÇ º¥´õ´Â GenAI Ãß·Ð È¿À²¼º°ú Áö¼Ó°¡´É¼ºÀ» ³ôÀ̱â À§ÇÑ ±â¼ú Çõ½ÅÀ» ÃßÁøÇϰí ÀÖ½À´Ï´Ù. º» ¹®¼­´Â Á¶Á÷ÀÌ Ãß·Ð Àü·«À» »ç¿ë »ç·Ê¿¡ ¸ÂÃß¾î Á¶Á¤Çϰí, Á¤±âÀûÀ¸·Î ºñ¿ëÀ» Àç°ËÅäÇϰí, Àü¹®°¡¿Í Á¦ÈÞÇÏ´Â °ÍÀ¸·Î ½Å·Ú¼º°ú È®À强ÀÌ ¶Ù¾î³­ AI µµÀÔÀ» ½ÇÇöÇϵµ·Ï ¾îµå¹ÙÀ̽ºÇϰí ÀÖ½À´Ï´Ù. "AI Ãß·ÐÀÇ ÃÖÀûÈ­´Â ´Ü¼øÈ÷ ¼Óµµ ¹®Á¦°¡ ¾Æ´Õ´Ï´Ù. ºñ¿ë, È®À强, Áö¼Ó °¡´É¼º °£ÀÇ ±ÕÇüÀ» ¼³°èÇÏ¿© Çõ½Å°ú ºñÁî´Ï½º ¿µÇâÀÌ ¸¸³ª´Â »ý»ê ȯ°æ¿¡¼­ »ý¼ºÇü AIÀÇ ÀáÀç·ÂÀ» ½ÇÇöÇÏ´Â °ÍÀÔ´Ï´Ù."¶ó°í IDCÀÇ AI ¼ÒÇÁÆ®¿þ¾î ¸®¼­Ä¡ µð·ºÅÍ Kathy Lange´Â ¸»Çß½À´Ï´Ù.

À̱×Á¦Å¥Æ¼ºê ½º³À¼ô

»óȲ °³¿ä

Å×Å©³î·¯Áö ±¸ÀÔÀÚ¿¡ ´ëÇÑ ¾îµå¹ÙÀ̽º

Âü°í ÀÚ·á

KSA
¿µ¹® ¸ñÂ÷

¿µ¹®¸ñÂ÷

The IDC Perspective explores the challenges and innovations in scaling generative AI (GenAI) inference workloads in production, emphasizing cost reduction, latency improvement, and scalability. It highlights techniques like model compression, batching, caching, and parallelization to optimize inference performance. Vendors such as AWS, DeepSeek, Google, IBM, Microsoft, NVIDIA, Red Hat, Snowflake, and WRITER are driving advancements to enhance GenAI inference efficiency and sustainability. The document advises organizations to align inference strategies with use cases, regularly review costs, and partner with experts to ensure reliable, scalable AI deployment."Optimizing AI inference isn't just about speed," says Kathy Lange, research director, AI Software, IDC. "It's about engineering the trade-offs between cost, scalability, and sustainability to unlock the potential of generative AI in production, where innovation meets business impact."

Executive Snapshot

Situation Overview

Advice for the Technology Buyer

Learn More

(ÁÖ)±Û·Î¹úÀÎÆ÷¸ÞÀÌ¼Ç 02-2025-2992 kr-info@giikorea.co.kr
¨Ï Copyright Global Information, Inc. All rights reserved.
PC¹öÀü º¸±â