Skip to content
  • KOSPI 2664.14 -7.43 -0.28%
  • KOSDAQ 775.79 -3.39 -0.44%
  • KOSPI200 356.15 -0.29 -0.08%
  • USD/KRW 1329 -1.00 0.08%
View Market Snapshot
Artificial intelligence

NCSOFT launches evaluation model to verify performance of AI LLMs

With ‘VARCO Judge LLM’, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs

By Sep 23, 2024 (Gmt+09:00)

1 Min read

NCSOFT launches evaluation model to verify performance of AI LLMs

NCSOFT Corp. announced on Monday that it has launched VARCO Judge LLM, the first evaluation model in South Korea to verify the performance and capabilities of artificial intelligence (AI) large language models (LLMs).
 
VARCO Judge LLM is an evaluation model that checks how quickly and accurately other LLMs perform tasks. 

With this model, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs and adopt the best model for their services.

R&D companies can also verify the performance level of their LLMs to demonstrate performance advantages or quickly identify and strengthen weaknesses. 

Headquarters of NCSOFT in Pangyo
Headquarters of NCSOFT in Pangyo

NCSOFT explained that VARCO Judge LLM has the highest performance among models in the same class, and plans to use it to improve the quality of its own LLM 'VARCO'.

“In the rapidly evolving AI market, services that select and apply the optimal model for each industry are becoming increasingly important,” said Lee Yeon-su, head of NCSOFT's research division.

”VARCO Judge LLM will not only improve the quality of existing LLM-based services, but will also become an indispensable tool for the AI business,” she added.

Write to Seung-Woo Lee at leeswoo@hankyung.com
More to Read
Comment 0
0/300