AgentScout Logo Agent Scout

ArXiv AI Agent Papers Tracker — Week of Jun 18, 2026

35 papers this week reveal breakthroughs in self-evolving agents, distributed P2P networks, and creative domain benchmarks. OPD-Evolver challenges 397B models with 9B parameters. GameCraft-Bench shows frontier models struggle in creative tasks.

AgentScout · · · 8 min read
#ai-agents #arxiv #research-papers #agent-benchmarks #self-evolving-agents
Analyzing Data Nodes...
SIG_CONF:CALCULATING
Verified Sources

Data Overview

Key Facts

  • Who: 35 papers total, 28 agent-related (80%), 6 multi-agent systems, 3 self-evolving agents
  • What: 7 new benchmarks introduced; average trend score for agent papers reaches 8.1 (up from 7.4 last week)
  • When: Week of June 18, 2026
  • Impact: OPD-Evolver, GameCraft-Bench, and Distributed Agent Networks emerge as top-scoring papers (trend score 10/10)

Methodology

This tracker monitors ArXiv cs.AI and cs.CL RSS feeds weekly, filtering for agent-related research. Papers are scored using a composite trend score (1-10) based on: novelty, citation potential, benchmark contributions, and community engagement (HuggingFace likes). Agent-related papers are identified through keyword matching in titles and abstracts. Data collection via Jina Reader API; direct ArXiv API access remains blocked.

This Week’s Metrics

MetricThis WeekLast WeekΔ
Total papers3531+4
Agent-related28280
Agent percentage80%90%-10pp
New benchmarks770
Avg trend score (agent)8.17.4+0.7
Multi-agent papers64+2
Self-evolving agents32+1

Top Papers This Week

TitleArXiv IDTrend ScoreKey Topics
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation2606.1762810agent evolution, self-evolving agents, memory hierarchy
Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and Prototypes2606.1736810distributed agents, P2P networks, multi-agent systems
GameCraft-Bench: Can Agents Build Playable Games End-to-End?2606.1786110game generation agents, coding benchmarks, creative agents
Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search2606.172099agentic search, multi-hop reasoning, query diversification
When Rules Learn: A Self-Evolving Agent for Legal Case Retrieval2606.172209self-evolving agents, legal AI, rule evolution
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning2606.176829multi-agent reasoning, RL agents, environment design
SEAGym: An Evaluation Environment for Self-Evolving LLM Agents2606.175469self-evolving agents, agent evaluation, evolution tracking
EComAgentBench: Benchmarking Shopping Agents on Long-Horizon Tasks2606.176989shopping agents, long-horizon tasks, hidden intent
Dissecting Model Behavior through Agent Trajectories2606.174549trajectory analysis, agent behavior, harness design

Notable Benchmarks This Week

BenchmarkArXiv IDDomainKey Insight
GameCraft-Bench2606.17861Game GenerationFirst end-to-end game generation benchmark in Godot; frontier models achieve only 41.46% success
EComAgentBench2606.17698E-commerce662 shopping tasks with distributed hidden intent; best model achieves 57.1% accuracy
SEAGym2606.17546Agent EvolutionTracks harness updates across training/validation/test/replay/cost for self-evolving agents
MapSatisfyBench2606.17453NavigationEvaluates satisfaction-aware map agents with implicit decision factors from real user data
CEO-Bench2606.17459StrategyStrategic resource reallocation with multi-agent C-suite simulation; reveals single-advisor capture failure mode
MemTrace2606.17328MemoryLong-term memory benchmark revealing evidence use bottleneck dominates failures
LongWebBench2606.17727Web Generation490 structural + 507 functional tasks for long-horizon webpage generation
TopicPaper CountAvg Trend ScoreNotable Papers
Self-evolving agents39.3OPD-Evolver, When Rules Learn, SEAGym
Distributed agents110.0Distributed General-Purpose Agent Networks
Multi-agent systems68.2CEO-Bench, Trainee to Trainer, Parasocial Scripts
Agent benchmarks77.9GameCraft-Bench, EComAgentBench, SEAGym
Agent memory47.5MemSlides, FinAcumen, MemTrace
Agentic search19.0DivInit

🔺 Scout Intel: What Others Missed

Confidence: high | Novelty Score: 62/100

While individual papers receive attention on HuggingFace, the collective signal across this week’s 35 papers reveals three structural shifts that most coverage misses:

1. Self-evolving agents are closing the parameter gap. OPD-Evolver’s 9B parameter model surpasses ReasoningBank by 11.5% and Skill0 by 5.8%, directly challenging 397B frontier models. This is not incremental improvement—it indicates that structured memory hierarchies (four-level in OPD-Evolver) can substitute for raw scale. The architecture matters more than parameter count for agent evolution tasks.

2. Creative domain benchmarks expose frontier model limitations. GameCraft-Bench shows even the strongest coding agents achieve only 41.46% success on end-to-end game generation. EComAgentBench’s best model hits 57.1% on shopping tasks with scattered requirements. These results contrast sharply with 90%+ scores on traditional benchmarks, revealing that frontier models still struggle with multi-step creative tasks requiring long-horizon planning and implicit requirement discovery.

3. Distributed P2P agent networks emerge as architectural alternative. The paper on Distributed General-Purpose Agent Networks (trend score 10) introduces the first systematic framework for peer-to-peer agent collaboration with BAID-based identity binding and MG-EigenTrust reputation. This shifts the paradigm from single-agent orchestration (LangChain, CrewAI) to decentralized agent networks—a direction no major framework currently addresses.

Key Implication: Enterprise teams building agent systems should prioritize memory architecture design (OPD-Evolver’s slow-fast co-evolution) over model parameter count, and prepare for distributed agent networks as the next architectural evolution beyond current orchestration frameworks.

  • Self-evolving frameworks surge: Three papers this week focus on self-evolving agents with explicit memory hierarchies, up from two last week. The +11.5% improvement over ReasoningBank signals that slow-fast co-evolution architectures are maturing.

  • Benchmark shift to complex real-world tasks: Seven new benchmarks target multi-step reasoning, creative generation, and hidden intent discovery—moving beyond single-turn tasks to scenarios requiring sustained agent reasoning.

  • Trajectory analysis at scale: 138k agent trajectories analyzed this week reveal model-specific behavioral patterns. This quantitative approach to agent behavior analysis is emerging as a standard evaluation tool.

  • Agent memory architectures diversify: Four distinct memory approaches emerged—hierarchical (MemSlides), experience-based (FinAcumen), long-term (MemTrace), and evolution-tracking (SEAGym). No consensus architecture yet; field is exploring multiple design points.

  • Long-horizon reasoning gains attention: Multiple benchmarks (EComAgentBench, LongWebBench, GameCraft-Bench) specifically target tasks requiring 10+ steps, indicating the field’s shift from single-turn to sustained reasoning.

Week-over-Week Summary

MetricThis WeekLast WeekΔ
Papers tracked3531+4
Agent-related papers28280
Agent percentage80%90%-10pp
Avg trend score (agent)8.17.4+0.7
Multi-agent papers64+2
Self-evolving agents32+1
Benchmarks introduced770
Trend score ≥ 99 papers4 papers+5

Notable change: Average trend score for agent papers jumped +0.7 points week-over-week, driven by three trend-score-10 papers (OPD-Evolver, Distributed Agent Networks, GameCraft-Bench). This indicates higher research quality concentration in the agent space.

Full Paper List

TitleAuthorsCategoryPublishedScoreArXivHF
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy DistillationNUS Research Teamcs.AI2026-06-17102606.17628Link
Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and PrototypesMultiple authorscs.AI2026-06-17102606.17368
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?CUHKSZcs.AI2026-06-17102606.17861Link
Beyond Parallel Sampling: Diverse Query Initialization for Agentic SearchCMU Research Teamcs.AI2026-06-1792606.17209
When Rules Learn: A Self-Evolving Agent for Legal Case RetrievalMultiple authorscs.AI2026-06-1792606.17220
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent ReasoningMultiple authorscs.AI2026-06-1792606.17682
SEAGym: An Evaluation Environment for Self-Evolving LLM AgentsMultiple authorscs.AI2026-06-1792606.17546
EComAgentBench: Benchmarking Shopping Agents on Long-Horizon Tasks with Distributed Hidden IntentMultiple authorscs.AI2026-06-1792606.17698
Dissecting Model Behavior through Agent TrajectoriesMultiple authorscs.AI2026-06-1792606.17454
Scaling Enterprise Agent Routing: Degradation, Diagnosis, and RecoveryMultiple authorscs.AI2026-06-1782606.17519
Can LLMs Be CEOs? Benchmarking Strategic Resource Reallocation with Multi-Role Agent SimulationMultiple authorscs.AI2026-06-1782606.17459
Environment-Grounded Automated Prompt Optimization for LLM Game AgentsMultiple authorscs.AI2026-06-1782606.17838
MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide GenerationYe Jin, Yangyang Xu, Jun Zhu, Yibo Yangcs.CL2026-06-1782606.17162
MapSatisfyBench: Benchmarking Satisfaction-Aware Map AgentsMultiple authorscs.AI2026-06-1782606.17453
Closing the Feedback Loop: From Experience Extraction to Insight Governance in Verbal Reinforcement LearningMultiple authorscs.AI2026-06-1782606.17591
StepGuard: Guarding Web Navigation via Single-Step CalibrationMultiple authorscs.AI2026-06-1782606.17871
FinAcumen: Financial Multimodal Reasoning via Self-Evolving Experience Memory HarnessMultiple authorscs.AI2026-06-1782606.17642
Beyond Domains: Reusing Web Skills via Transferable Interaction PatternsMultiple authorscs.AI2026-06-1782606.17645
Surrogate Assisted Pedestrian Protection Design via a Foundation Model Orchestrated WorkflowMultiple authorscs.AI2026-06-1772606.17577
DecoSearch: Complexity-Aware Routing and Plan-Level Repair for Text-to-SQLMultiple authorscs.AI2026-06-1772606.17821
LLM-as-Judge in Education: A Curriculum-Grounded Marking PipelineMultiple authorscs.AI2026-06-1772606.17507
AIPatient Arena: EHR-grounded evaluation of LLMs in clinical consultation workflowsMultiple authorscs.AI2026-06-1772606.17474
From Parasocial Scripts to Dyadic Persistence in Autonomous AI-Agent CommunitiesMohammadsadegh Abolhasani et al.cs.CL2026-06-1772606.17174
LecturaAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted LearningMultiple authorscs.CL2026-06-1572606.16428Link
DeepInsight: A Unified Evaluation Infrastructure Across the Physical AI StackMultiple authorscs.AI2026-06-1772606.17574
FlowRAG: Synergizing Explicit Reasoning via Frequency-Aware Multi-Granularity Graph FlowMultiple authorscs.AI2026-06-1772606.17856
MODE-RAG: Manifold Outlier Diagnosis and Energy-based Retrieval-Augmented Generation EvaluationMultiple authorscs.CL2026-06-1772606.17449
Brick-DICL: Dynamic In-Context Learning for Automated Brick Schema ClassificationMultiple authorscs.AI2026-06-1772606.17637
LongWebBench: Evaluating Structural and Functional Webpage Generation in Long-Horizon SettingsMultiple authorscs.AI2026-06-1772606.17727
MemTrace: Probing What Final Accuracy Misses in Long-Term MemoryMultiple authorscs.AI2026-06-1772606.17328
PromptMN: Pseudo Prompting LanguageEnkhzol Dovdoncs.CL2026-06-1762606.17164
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling19 authorscs.AI2026-06-1762606.18023Link
Zone of Proximal Policy Optimization: Teacher in Prompts, Not GradientsNVIDIAcs.AI2026-06-1762606.18216Link
ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA PretrainingCUHKcs.AI2026-06-1762606.17200Link

Previous Snapshots

Sources

ArXiv AI Agent Papers Tracker — Week of Jun 18, 2026

35 papers this week reveal breakthroughs in self-evolving agents, distributed P2P networks, and creative domain benchmarks. OPD-Evolver challenges 397B models with 9B parameters. GameCraft-Bench shows frontier models struggle in creative tasks.

AgentScout · · · 8 min read
#ai-agents #arxiv #research-papers #agent-benchmarks #self-evolving-agents
Analyzing Data Nodes...
SIG_CONF:CALCULATING
Verified Sources

Data Overview

Key Facts

  • Who: 35 papers total, 28 agent-related (80%), 6 multi-agent systems, 3 self-evolving agents
  • What: 7 new benchmarks introduced; average trend score for agent papers reaches 8.1 (up from 7.4 last week)
  • When: Week of June 18, 2026
  • Impact: OPD-Evolver, GameCraft-Bench, and Distributed Agent Networks emerge as top-scoring papers (trend score 10/10)

Methodology

This tracker monitors ArXiv cs.AI and cs.CL RSS feeds weekly, filtering for agent-related research. Papers are scored using a composite trend score (1-10) based on: novelty, citation potential, benchmark contributions, and community engagement (HuggingFace likes). Agent-related papers are identified through keyword matching in titles and abstracts. Data collection via Jina Reader API; direct ArXiv API access remains blocked.

This Week’s Metrics

MetricThis WeekLast WeekΔ
Total papers3531+4
Agent-related28280
Agent percentage80%90%-10pp
New benchmarks770
Avg trend score (agent)8.17.4+0.7
Multi-agent papers64+2
Self-evolving agents32+1

Top Papers This Week

TitleArXiv IDTrend ScoreKey Topics
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy Distillation2606.1762810agent evolution, self-evolving agents, memory hierarchy
Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and Prototypes2606.1736810distributed agents, P2P networks, multi-agent systems
GameCraft-Bench: Can Agents Build Playable Games End-to-End?2606.1786110game generation agents, coding benchmarks, creative agents
Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search2606.172099agentic search, multi-hop reasoning, query diversification
When Rules Learn: A Self-Evolving Agent for Legal Case Retrieval2606.172209self-evolving agents, legal AI, rule evolution
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning2606.176829multi-agent reasoning, RL agents, environment design
SEAGym: An Evaluation Environment for Self-Evolving LLM Agents2606.175469self-evolving agents, agent evaluation, evolution tracking
EComAgentBench: Benchmarking Shopping Agents on Long-Horizon Tasks2606.176989shopping agents, long-horizon tasks, hidden intent
Dissecting Model Behavior through Agent Trajectories2606.174549trajectory analysis, agent behavior, harness design

Notable Benchmarks This Week

BenchmarkArXiv IDDomainKey Insight
GameCraft-Bench2606.17861Game GenerationFirst end-to-end game generation benchmark in Godot; frontier models achieve only 41.46% success
EComAgentBench2606.17698E-commerce662 shopping tasks with distributed hidden intent; best model achieves 57.1% accuracy
SEAGym2606.17546Agent EvolutionTracks harness updates across training/validation/test/replay/cost for self-evolving agents
MapSatisfyBench2606.17453NavigationEvaluates satisfaction-aware map agents with implicit decision factors from real user data
CEO-Bench2606.17459StrategyStrategic resource reallocation with multi-agent C-suite simulation; reveals single-advisor capture failure mode
MemTrace2606.17328MemoryLong-term memory benchmark revealing evidence use bottleneck dominates failures
LongWebBench2606.17727Web Generation490 structural + 507 functional tasks for long-horizon webpage generation
TopicPaper CountAvg Trend ScoreNotable Papers
Self-evolving agents39.3OPD-Evolver, When Rules Learn, SEAGym
Distributed agents110.0Distributed General-Purpose Agent Networks
Multi-agent systems68.2CEO-Bench, Trainee to Trainer, Parasocial Scripts
Agent benchmarks77.9GameCraft-Bench, EComAgentBench, SEAGym
Agent memory47.5MemSlides, FinAcumen, MemTrace
Agentic search19.0DivInit

🔺 Scout Intel: What Others Missed

Confidence: high | Novelty Score: 62/100

While individual papers receive attention on HuggingFace, the collective signal across this week’s 35 papers reveals three structural shifts that most coverage misses:

1. Self-evolving agents are closing the parameter gap. OPD-Evolver’s 9B parameter model surpasses ReasoningBank by 11.5% and Skill0 by 5.8%, directly challenging 397B frontier models. This is not incremental improvement—it indicates that structured memory hierarchies (four-level in OPD-Evolver) can substitute for raw scale. The architecture matters more than parameter count for agent evolution tasks.

2. Creative domain benchmarks expose frontier model limitations. GameCraft-Bench shows even the strongest coding agents achieve only 41.46% success on end-to-end game generation. EComAgentBench’s best model hits 57.1% on shopping tasks with scattered requirements. These results contrast sharply with 90%+ scores on traditional benchmarks, revealing that frontier models still struggle with multi-step creative tasks requiring long-horizon planning and implicit requirement discovery.

3. Distributed P2P agent networks emerge as architectural alternative. The paper on Distributed General-Purpose Agent Networks (trend score 10) introduces the first systematic framework for peer-to-peer agent collaboration with BAID-based identity binding and MG-EigenTrust reputation. This shifts the paradigm from single-agent orchestration (LangChain, CrewAI) to decentralized agent networks—a direction no major framework currently addresses.

Key Implication: Enterprise teams building agent systems should prioritize memory architecture design (OPD-Evolver’s slow-fast co-evolution) over model parameter count, and prepare for distributed agent networks as the next architectural evolution beyond current orchestration frameworks.

  • Self-evolving frameworks surge: Three papers this week focus on self-evolving agents with explicit memory hierarchies, up from two last week. The +11.5% improvement over ReasoningBank signals that slow-fast co-evolution architectures are maturing.

  • Benchmark shift to complex real-world tasks: Seven new benchmarks target multi-step reasoning, creative generation, and hidden intent discovery—moving beyond single-turn tasks to scenarios requiring sustained agent reasoning.

  • Trajectory analysis at scale: 138k agent trajectories analyzed this week reveal model-specific behavioral patterns. This quantitative approach to agent behavior analysis is emerging as a standard evaluation tool.

  • Agent memory architectures diversify: Four distinct memory approaches emerged—hierarchical (MemSlides), experience-based (FinAcumen), long-term (MemTrace), and evolution-tracking (SEAGym). No consensus architecture yet; field is exploring multiple design points.

  • Long-horizon reasoning gains attention: Multiple benchmarks (EComAgentBench, LongWebBench, GameCraft-Bench) specifically target tasks requiring 10+ steps, indicating the field’s shift from single-turn to sustained reasoning.

Week-over-Week Summary

MetricThis WeekLast WeekΔ
Papers tracked3531+4
Agent-related papers28280
Agent percentage80%90%-10pp
Avg trend score (agent)8.17.4+0.7
Multi-agent papers64+2
Self-evolving agents32+1
Benchmarks introduced770
Trend score ≥ 99 papers4 papers+5

Notable change: Average trend score for agent papers jumped +0.7 points week-over-week, driven by three trend-score-10 papers (OPD-Evolver, Distributed Agent Networks, GameCraft-Bench). This indicates higher research quality concentration in the agent space.

Full Paper List

TitleAuthorsCategoryPublishedScoreArXivHF
OPD-Evolver: Cultivating Holistic Agent Evolver via On-Policy DistillationNUS Research Teamcs.AI2026-06-17102606.17628Link
Distributed General-Purpose Agent Networks: Architecture, Key Mechanisms, and PrototypesMultiple authorscs.AI2026-06-17102606.17368
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?CUHKSZcs.AI2026-06-17102606.17861Link
Beyond Parallel Sampling: Diverse Query Initialization for Agentic SearchCMU Research Teamcs.AI2026-06-1792606.17209
When Rules Learn: A Self-Evolving Agent for Legal Case RetrievalMultiple authorscs.AI2026-06-1792606.17220
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent ReasoningMultiple authorscs.AI2026-06-1792606.17682
SEAGym: An Evaluation Environment for Self-Evolving LLM AgentsMultiple authorscs.AI2026-06-1792606.17546
EComAgentBench: Benchmarking Shopping Agents on Long-Horizon Tasks with Distributed Hidden IntentMultiple authorscs.AI2026-06-1792606.17698
Dissecting Model Behavior through Agent TrajectoriesMultiple authorscs.AI2026-06-1792606.17454
Scaling Enterprise Agent Routing: Degradation, Diagnosis, and RecoveryMultiple authorscs.AI2026-06-1782606.17519
Can LLMs Be CEOs? Benchmarking Strategic Resource Reallocation with Multi-Role Agent SimulationMultiple authorscs.AI2026-06-1782606.17459
Environment-Grounded Automated Prompt Optimization for LLM Game AgentsMultiple authorscs.AI2026-06-1782606.17838
MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide GenerationYe Jin, Yangyang Xu, Jun Zhu, Yibo Yangcs.CL2026-06-1782606.17162
MapSatisfyBench: Benchmarking Satisfaction-Aware Map AgentsMultiple authorscs.AI2026-06-1782606.17453
Closing the Feedback Loop: From Experience Extraction to Insight Governance in Verbal Reinforcement LearningMultiple authorscs.AI2026-06-1782606.17591
StepGuard: Guarding Web Navigation via Single-Step CalibrationMultiple authorscs.AI2026-06-1782606.17871
FinAcumen: Financial Multimodal Reasoning via Self-Evolving Experience Memory HarnessMultiple authorscs.AI2026-06-1782606.17642
Beyond Domains: Reusing Web Skills via Transferable Interaction PatternsMultiple authorscs.AI2026-06-1782606.17645
Surrogate Assisted Pedestrian Protection Design via a Foundation Model Orchestrated WorkflowMultiple authorscs.AI2026-06-1772606.17577
DecoSearch: Complexity-Aware Routing and Plan-Level Repair for Text-to-SQLMultiple authorscs.AI2026-06-1772606.17821
LLM-as-Judge in Education: A Curriculum-Grounded Marking PipelineMultiple authorscs.AI2026-06-1772606.17507
AIPatient Arena: EHR-grounded evaluation of LLMs in clinical consultation workflowsMultiple authorscs.AI2026-06-1772606.17474
From Parasocial Scripts to Dyadic Persistence in Autonomous AI-Agent CommunitiesMohammadsadegh Abolhasani et al.cs.CL2026-06-1772606.17174
LecturaAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted LearningMultiple authorscs.CL2026-06-1572606.16428Link
DeepInsight: A Unified Evaluation Infrastructure Across the Physical AI StackMultiple authorscs.AI2026-06-1772606.17574
FlowRAG: Synergizing Explicit Reasoning via Frequency-Aware Multi-Granularity Graph FlowMultiple authorscs.AI2026-06-1772606.17856
MODE-RAG: Manifold Outlier Diagnosis and Energy-based Retrieval-Augmented Generation EvaluationMultiple authorscs.CL2026-06-1772606.17449
Brick-DICL: Dynamic In-Context Learning for Automated Brick Schema ClassificationMultiple authorscs.AI2026-06-1772606.17637
LongWebBench: Evaluating Structural and Functional Webpage Generation in Long-Horizon SettingsMultiple authorscs.AI2026-06-1772606.17727
MemTrace: Probing What Final Accuracy Misses in Long-Term MemoryMultiple authorscs.AI2026-06-1772606.17328
PromptMN: Pseudo Prompting LanguageEnkhzol Dovdoncs.CL2026-06-1762606.17164
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling19 authorscs.AI2026-06-1762606.18023Link
Zone of Proximal Policy Optimization: Teacher in Prompts, Not GradientsNVIDIAcs.AI2026-06-1762606.18216Link
ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA PretrainingCUHKcs.AI2026-06-1762606.17200Link

Previous Snapshots

Sources

xivyek6kfzrwqahpgd4mpk████a27tj8575aq4i00wy1223on5sbx7qfpt░░░ml9wnbreg5yzi8o4aax1e4v8v23nuuj░░░b30vbormdbux0jatj3fx4pvmmwfi5d9wk████pfxoffb1avcpmcywag53t7bmd42zs41v████w2ik7xj8srn5wundgk2aqj5z3ossnkll2████gk7k17oh62oo9cqu7kolcfhc8ne9munob░░░dw0ncyr3sbvf71hy2w5g7vamt4cp8xy░░░1uuixkivjr6kbdgtiv1bojvv07griclg░░░laqg0adxan20j2dd21irhei0bfm4e6h6░░░6zje5b8wa5nhup742d9qah2p1p4mz4zl████1ynp1efs8dgj018jcp3h0bhvzebjnnv0d░░░tqs9df2llmm0q0iy41n9s7fvjx4ulgou3e░░░vggqdjl4dxll6gq03ly9sil85d5ilvlja░░░ecgvvwfa89mnvylqb53h2co7yb1b8h5y████psxh7q7radfreiy6bbz9ntj3xwluszzm████haf5ghwod08e6w035ey4e6dq4cdb9w1zm░░░yxmooqwkmunvswsf6xa6po0kzbmssq4████37w530a2mfrjwm3y2d2uy8gzbydnlmrfm░░░806612h9lggcyaw1bihhy3mhgmqg3qc████oe4r2mq2gnlziodecaopupqb9fnls2████a0or3o4ln0iav0sq3nol2cc72194yelk░░░5xb6v395wsbuvpzl5vl1vgiscxjla65c9████ozkz9ium1jrn01qqaltqxeq7bwqp497g████w9wz3ka2ksngnefn66zyvrbtl6g85f1ss░░░huhmkqk80cv4yfgwyoa2dhrtl0on7v7r░░░8roujmdrvf40rbm7d3voftgvmpfmgnak████ehgjq1pdpg2tbb8d0nvlxl0keob891w░░░0ve7uppfj5cdqv2ns1hhd3h7uum54qtlk████ulqvopf4njkfu6loh1u7wghlqmir4u2ua░░░x5bubm5nux8yyo5qb3sjbusi3acgn8qh████xa4txqox3bh06lcvaf7e1nq8r6d9ntvc9s████0avj1ecfh6793byovki3rzqbvru516q9q8████xgoqk5tmukldgkzk58zz9afkekjmkjup░░░xeqkzk8ei3k11bwx4ukxi9isuinkfa1jb████krdc7wj1jyg0opr6ys7druczo798jpiqym████j0wxvv7cgngtwdcs2lsn5x35b5lu3cc████3gpzrc1l9c4jra6kks5dxrqzzsfbqn6ns░░░rzv1857noti7n9f9p47doqadr7h1umqfw████ez7z5e7y0nil6cp692jqhb6ptjub4667t████y3bpvnen87zyvp2agn9artktrnuvevxb████i3koymif43aikcmxwwok5wnhr1b63j5j░░░r355uccmdpiaki6x3iclidv43f54uqfr████5ma35h2tx17zwjs584qz87rzavxse24████dxulcg3l8fu6youii6z2s238xqlito2s████9djliq0bxamf74ocllroiotjye0e6gyq8████1r9palpcwsbi66n88mz9pncyovb2kwa7l░░░xl529t23jfbxa43k4hw65vynfpqwy4ad████pxyrx117yjiob4hyx1c1msqbfnmnzffcm████dgd48e52nlqgjck78guqccuqrarnexq░░░e0an8xf9925