OpenNovelty: An Open-domain Benchmark for Evaluating the Open-ended Novelty of Language Models
OpenNovelty studies whether language models can judge the novelty of open-ended ideas rather than only solve fixed-answer tasks. It introduces an open-domain benchmark for comparing LLM judgments of novelty ...