|
3 | 3 | <head> |
4 | 4 | <meta charset="utf-8"> |
5 | 5 | <!-- Meta tags for social media banners --> |
6 | | - <meta name="description" content="Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement"> |
7 | | - <meta property="og:title" content="Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement"/> |
8 | | - <meta property="og:description" content="A comprehensive survey on large language model evaluation methods"/> |
| 6 | + <meta name="description" content="Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement"> |
| 7 | + <meta property="og:title" content="Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement"/> |
| 8 | + <meta property="og:description" content="A systematic review on large language model evaluation methods"/> |
9 | 9 | <meta property="og:url" content="URL OF THE WEBSITE"/> |
10 | 10 | <meta property="og:image" content="static/images/overview_00.png" /> |
11 | 11 | <meta property="og:image:width" content="1200"/> |
12 | 12 | <meta property="og:image:height" content="630"/> |
13 | 13 |
|
14 | | - <meta name="twitter:title" content="Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement"> |
15 | | - <meta name="twitter:description" content="A comprehensive survey on large language model evaluation methods"> |
| 14 | + <meta name="twitter:title" content="Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement"> |
| 15 | + <meta name="twitter:description" content="A systematic review on large language model evaluation methods"> |
16 | 16 | <meta name="twitter:image" content="static/images/logo-survey.png"> |
17 | 17 | <meta name="twitter:card" content="summary_large_image"> |
18 | 18 | <meta name="keywords" content="large language models, LLM, psychometrics, evaluation, validation, enhancement, survey"> |
19 | 19 | <meta name="viewport" content="width=device-width, initial-scale=1"> |
20 | 20 |
|
21 | 21 | <title>Large Language Model Psychometrics: |
22 | | - A Comprehensive Survey of |
| 22 | + A Systematic Review of |
23 | 23 | Evaluation, Validation, and Enhancement |
24 | 24 | </title> |
25 | 25 | <link rel="icon" type="image/x-icon" href="static/images/logo-survey.ico"> |
|
367 | 367 | <div class="container is-max-desktop"> |
368 | 368 | <div class="columns is-centered"> |
369 | 369 | <div class="column has-text-centered"> |
370 | | - <h1 class="title is-1 publication-title">Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement</h1> |
| 370 | + <h1 class="title is-1 publication-title">Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement</h1> |
371 | 371 |
|
372 | 372 |
|
373 | 373 | <div class="is-size-5 publication-authors authors-section"> |
@@ -464,21 +464,6 @@ <h2 class="title is-3 has-text-centered">Abstract</h2> |
464 | 464 | <div class="abstract content"> |
465 | 465 | <p> |
466 | 466 | The survey will be released soon. Stay tuned! |
467 | | - <!-- The evolving capabilities of large language models (LLMs) have outpaced traditional evaluation methodologies and introduced novel evaluation challenges, |
468 | | - such as assessing human-like psychological constructs, addressing the limitations |
469 | | - of static and task-specific benchmarks, and meeting the requirement for human-centered |
470 | | - evaluation. These challenges intersect with psychometrics, the science |
471 | | - of quantifying intangible aspects of human psychology. This survey introduces |
472 | | - and synthesizes the emerging interdisciplinary field of LLM psychometrics, which |
473 | | - applies psychometric instruments, theories, and principles to evaluate, understand, |
474 | | - and enhance LLMs. We systematically review the role of psychometrics in shaping |
475 | | - LLM benchmarking principles, broadening evaluation scopes, refining evaluation |
476 | | - methodologies, validating assessment processes, and enhancing LLM capabilities. |
477 | | - The integration of diverse efforts and insights in this paper offers a structured |
478 | | - framework for researchers from various backgrounds, facilitating a comprehensive |
479 | | - understanding of this emerging field. We aim to offer insights into developing |
480 | | - future evaluation paradigms for human-level AI and advance human-centered AI |
481 | | - psychology for the greater common good. --> |
482 | 467 | </p> |
483 | 468 | </div> |
484 | 469 | </div> |
@@ -697,15 +682,15 @@ <h2 class="title is-3 has-text-centered">Citation</h2> |
697 | 682 | </button> |
698 | 683 | <pre> |
699 | 684 | @article{ye2025llmpsychometrics, |
700 | | - title={Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement}, |
| 685 | + title={Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement}, |
701 | 686 | author={Ye, Haoran and Jin, Jing and Xie, Yuhang, and Zhang, Xin and Song, Guojie}, |
702 | 687 | year={2025}, |
703 | 688 | } |
704 | 689 | </pre> |
705 | 690 | <script> |
706 | 691 | function copyCitation() { |
707 | 692 | const citationText = `@article{ye2025llmpsychometrics, |
708 | | - title={Large Language Model Psychometrics: A Comprehensive Survey of Evaluation, Validation, and Enhancement}, |
| 693 | + title={Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement}, |
709 | 694 | author={Ye, Haoran and Jin, Jing and Xie, Yuhang, and Zhang, Xin and Song, Guojie}, |
710 | 695 | year={2025}, |
711 | 696 | }`; |
|
0 commit comments