Artificial intelligence

NCSOFT launches evaluation model to verify performance of AI LLMs

With ‘VARCO Judge LLM’, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs

By Sep 23, 2024 (Gmt+09:00)

1 Min read

leeswoo@hankyung.com

Most Read

Samsung steps up AR race with advanced microdisplay for smart glasses

When in S. Korea, it’s a ritual: Foreigners make stops at CU, GS25, 7-Eleven

Maybe Happy Ending: A robot love story that rewrote Broadway playbook

NPS yet to schedule external manager selection; PE firms’ fundraising woes deepen

US auto parts tariffs take effect; Korea avoids heavy hit

NCSOFT　launches　evaluation　model　to　verify　performance　of　AI　LLMs

NCSOFT Corp. announced on Monday that it has launched VARCO Judge LLM, the first evaluation model in South Korea to verify the performance and capabilities of artificial intelligence (AI) large language models (LLMs).

VARCO Judge LLM is an evaluation model that checks how quickly and accurately other LLMs perform tasks.

With this model, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs and adopt the best model for their services.

R&D companies can also verify the performance level of their LLMs to demonstrate performance advantages or quickly identify and strengthen weaknesses.

NCSOFT explained that VARCO Judge LLM has the highest performance among models in the same class, and plans to use it to improve the quality of its own LLM 'VARCO'.

“In the rapidly evolving AI market, services that select and apply the optimal model for each industry are becoming increasingly important,” said Lee Yeon-su, head of NCSOFT's research division.

”VARCO Judge LLM will not only improve the quality of existing LLM-based services, but will also become an indispensable tool for the AI business,” she added.

Write to Seung-Woo Lee at leeswoo@hankyung.com

NCSOFT launches evaluation model to verify performance of AI LLMs

With ‘VARCO Judge LLM’, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs

Cookies on KED Global

Currency Converter

NCSOFT launches evaluation model to verify performance of AI LLMs

With ‘VARCO Judge LLM’, companies creating AI-based services can quickly compare and evaluate the quality of various LLMs

Cookies on KED Global

Fill in the information to subscribe to our newsletter and you can also getunlimited access to the latest intelligence on Korean asset owners.

Fill in the information to download the full story ofHidden Champions and Next Unicorns.

Currency Converter

Fill in the information to subscribe to our newsletter and you can also get
unlimited access to the latest intelligence on Korean asset owners.

Fill in the information to download the full story of
Hidden Champions and Next Unicorns.