Entities · Labs
ScienceCast
13 articles tagged with this entity.
-
AI-Driven Test Case Generation from Natural Language Requirements: A Survey of Techniques and Research Gaps
-
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating
-
MalTree: Tracing Malware Evolution from Embeddings at Scale
-
Inside the Visual Mind: Neuroscience-Motivated Concept Circuits for Interpreting and Steering Vision Transformers
-
Characterize Then Distill: Mechanistic Reasoning in Large Output Spaces
-
Evidence-Based Intelligent Diagnostic and Therapeutic Visualization System with Large Language Models: Multi-Turn Interaction and Multimodal Treatment Plan Generation
-
LLM Agent-Assisted Reverse Engineering with Quantitative Readability Metrics
-
MacArena: Benchmarking Computer Use Agents on an Online macOS Environment
-
EASE-TTT: Evidence-Aligned Selective Test-Time Training for Long-Context Question Answering
-
Act As a Real Researcher: A Suite of Benchmarks Evaluating Frontier LLMs and Agentic Harnesses in Research Lifecycle
-
Agentic Large Language Models for Automated Structural Analysis of 3D Frame Systems
-
OpenHalDet: A Unified Benchmark for Hallucination Detection across Diverse Generation Scenarios
-
GP-Adapter: Gaussian Process CLIP-Adapter for Few-Shot Out-of-Distribution Detection