Researchers say they’ve discovered a supply-chain attack flooding repositories with malicious packages that contain invisible ...
An evaluation suite for agentic models in real MCP tool environments (Notion / GitHub / Filesystem / Postgres / Playwright). MCPMark provides a reproducible, extensible benchmark for researchers and ...
Abstract: The rapid scaling of large language model (LLM) training and inference has accelerated their adoption in semiconductor design across academia and industry. Most prior works benchmark LLMs ...
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi… ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results