AI & Robotics Research Daily Digest

중요도 기준과 수집 소스

업데이트

매일 08:00 KST 전후 수집을 시작하고, 09:00 KST 전후 공개 페이지에 새 연구 소식을 반영하는 것을 목표로 합니다.

표시 범위

기본적으로 최근 14일 이내 항목만 표시합니다. 날짜가 없는 항목은 Hugging Face, GitHub, Papers with Code처럼 일일 트렌딩 피드에서 온 경우에만 포함합니다.

중요도

★★★★★패러다임 전환, 주요 SOTA 갱신, 대형 오픈소스 모델 공개

★★★★의미 있는 기술 혁신, 주요 연구소 발표, 코드 공개 우수 논문

★★★유용한 개선, 도구, 데이터셋, 벤치마크, 실용 응용

★★니치하지만 해당 분야에서 참고할 만한 결과

★관련성은 있지만 연구 임팩트가 제한적인 항목

수집 소스

🧠 AI / ML

arXiv AI, arXiv Machine Learning, arXiv Computation and Language, arXiv Computer Vision, r/MachineLearning, Hugging Face Papers

🤖 Physical AI & Robotics

arXiv Robotics, arXiv Systems and Control, r/robotics, The Robot Report, Physical Intelligence, Generalist AI, Figure AI, Skild AI, 1X, Agility Robotics, Apptronik, Sanctuary AI, Unitree, NEURA Robotics, Genesis AI, Google DeepMind Robotics, NVIDIA Robotics, Hugging Face LeRobot, Toyota Research Institute, Boston Dynamics

🎯 RL / Control

r/reinforcementlearning

🔧 Models & Tools

Papers with Code, Hugging Face Models, GitHub

🏢 Industry & Analysis

OpenAI Blog, Import AI, IEEE Spectrum, The Gradient, Alignment Forum

🔥 Community

Hacker News, r/computervision

🧠 AI / ML

21건

A Hippocampus for Linear Attention: Exact Memory for What the Recurrent State Forgets

arxiv_ai 2026-07-01 ★★★★☆

선형 어텐션(Linear Attention)에 정확한 명시적 메모리(explicit memory)를 더해 순환 상태가 잃어버리는 정보를 복원하는 아키텍처.

Morphing into Hybrid Attention Models

hf_papers 2026-06-28 ▲ 37 ★★★★☆

일부 풀 어텐션(full-attention) 층만 남기는 하이브리드 어텐션(Hybrid Attention)으로 변환해 장문맥 효율을 높이는 방법.

Seed2.0 Model Card: Towards Intelligence Frontier for Real-World Complexity

hf_papers 2026-06-29 ▲ 23 ★★★★☆

실세계 복잡성 해결을 목표로 새로운 학습 방식을 적용한 Seed2.0 모델 시리즈 공개.

ASPIRE: Agentic Skills Discovery for Robotics

hf_papers 2026-06-29 ▲ 18 ★★★★☆

로봇 스킬을 자율적으로 발견·조합하는 에이전트형(agentic) 기술 탐색 프레임워크를 로보틱스에 적용.

HERMES: A Multi-Granularity Labeling Substrate for Pre-training Data Mixtures

arxiv_lg 2026-07-01 ★★★☆☆

다중 세분도(multi-granularity) 라벨링 체계로 파운데이션 모델의 사전학습 데이터 혼합(data mixture)을 최적화하는 프레임워크.

DecompRL: Solving Harder Problems by Learning Modular Code Generation

arxiv_lg 2026-07-01 ★★★☆☆

모듈형 코드 생성을 강화학습(RL)으로 학습해 더 어려운 프로그래밍 문제를 푸는 방법.

Bayesian Sparse Low-Rank Adaptation for LLM Uncertainty Estimation

arxiv_lg 2026-07-01 ★★★☆☆

희소 베이지안 LoRA(Sparse Bayesian LoRA)로 대형 언어 모델의 예측 불확실성(uncertainty)을 효율적으로 추정하는 기법.

Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

arxiv_lg 2026-07-01 ★★★☆☆

뉴런 활성(neuron activation) 기반으로 학습 데이터를 선별해 주석 없이(annotation-free) LLM 자기 증류(self-distillation)를 개선.

Optimizing Visual Generative Models via Distribution-wise Rewards

arxiv_lg 2026-07-01 ★★★☆☆

샘플 단위를 넘어 분포 단위 보상(distribution-wise reward)으로 이미지 생성 모델을 최적화하는 기법.

ReContext: Recursive Evidence Replay as LLM Harness for Long-Context Reasoning

arxiv_ai 2026-07-01 ★★★☆☆

증거를 재귀적으로 재생(recursive evidence replay)해 LLM의 장문맥(long-context) 추론을 돕는 하네스.

G-RRM: Guiding Symbolic Solvers with Recurrent Reasoning Models

arxiv_ai 2026-07-01 ★★★☆☆

순환 추론 모델(Recurrent Reasoning Model)로 기호적 솔버(symbolic solver)를 안내해 수학 문제 해결을 개선하는 신경-기호(neuro-symbolic) 방법.

Fast Multi-dimensional Refusal Subspaces via RFM-AGOP

arxiv_ai 2026-07-01 ★★★☆☆

RFM-AGOP로 모델의 다차원 거부(refusal) 부분공간을 빠르게 규명하는 기계적 해석(mechanistic interpretability) 연구.

What LLM Agents Say When No One Is Watching: Social Structure and Latent Objective Emergence

arxiv_ai 2026-07-01 ★★★☆☆

감독 없이 상호작용하는 다중 LLM 에이전트에서 나타나는 사회 구조와 잠재 목표(latent objective)의 창발을 분석.

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

hf_papers 2026-07-01 ▲ 69 ★★★☆☆

규칙화가 어려운 '퍼지(fuzzy)' 함수를 학습 가능한 가중치(weights)로 다루는 새로운 프로그래밍 패러다임.

WorldDirector: Controllable Video World Model with Persistent Dynamic Memory

hf_papers 2026-07-01 ▲ 20 ★★★☆☆

지속적 동적 메모리(persistent dynamic memory)와 자유 시점 탐색을 지원하는 제어형 비디오 월드 모델(video World Model).

Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration

hf_papers 2026-07-01 ▲ 26 ★★★☆☆

학습 없이(training-free) 하드웨어 무관하게 텍스트-이미지 확산(diffusion) 모델을 가속하는 다중 해상도 플로우 매칭(Flow Matching).

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

hf_papers 2026-07-01 ▲ 41 ★★★☆☆

상호작용 환경에서 AI 정책의 자율 진화(policy evolution)를 평가하는 벤치마크.

DemoPSD: Disagreement-Modulated Policy Self-Distillation

arxiv_lg 2026-07-01 ★★☆☆☆

불일치(disagreement) 신호로 조절하는 정책 자기 증류(self-distillation)로 일반화 성능을 높인 학습법.

kNNGuard: LLM Hidden Activations as Training-Free Configurable Guardrail

arxiv_lg 2026-07-01 ★★☆☆☆

은닉 활성(hidden activation)을 활용해 재학습 없이 조절 가능한 LLM 안전 가드레일(guardrail)을 구성.

Online Safety Monitoring for LLMs

arxiv_ai 2026-07-01 ★★☆☆☆

배포된 LLM을 실시간으로 감시하는 온라인 안전 모니터링(safety monitoring) 방법.

Steerability via constraints: a substrate for scalable oversight of coding agents

arxiv_ai 2026-07-01 ★★☆☆☆

제약 기반 조종(constraint-based steering)으로 코딩 에이전트를 확장 가능하게 감독(scalable oversight)하는 프레임워크.

🤖 Physical AI & Robotics

9건

VT-WAM: Visual-Tactile World Action Model for Contact-Rich Manipulation

arxiv_ro 2026-07-01 ★★★★☆

시각·촉각(Visual-Tactile) 센싱을 하나의 월드 액션 모델(World Action Model)에 통합해 접촉이 많은(contact-rich) 로봇 조작 성능을 높인 연구.

Learning to Move Before Learning to Do: Task-Agnostic Pretraining for VLAs

arxiv_ro 2026-07-01 ★★★★☆

작업 무관(task-agnostic) '움직임' 사전학습을 먼저 수행해 VLA의 downstream 조작 성능을 끌어올리는 사전학습 기법 제안.

WorldSample: Closed-loop Real-robot RL with World Modelling

arxiv_ro 2026-07-01 ★★★★☆

학습된 월드 모델(World Model)로 실제 로봇의 폐루프(closed-loop) 강화학습(RL)을 수행해 샘플 효율을 높인 프레임워크.

Actuator Reality Shaping for Zero-Shot Sim-to-Real Robot Learning

arxiv_ro 2026-07-01 ★★★★☆

액추에이터(actuator) 동역학을 정합시켜 미세조정 없이 시뮬레이션 정책을 실물 로봇에 zero-shot으로 전이하는 sim-to-real 기법.

Guided Action Flow: Q-Guided Inference for Flow-Matching VLAs

arxiv_ro 2026-07-01 ★★★☆☆

Q-가이드 추론으로 플로우 매칭(Flow-Matching) 기반 VLA 정책의 행동 선택을 개선하는 추론 기법.

HEFT: Heavy-Payload Full-size Humanoid Teleoperation with Privileged Motion Guidance

arxiv_ro 2026-07-01 ★★★☆☆

특권 모션 가이드(privileged motion guidance)로 풀사이즈 휴머노이드가 무거운 짐을 다루도록 하는 원격조작(Teleoperation) 시스템.

Learning Agile Intruder Interception using Differentiable Quadrotor Dynamics

arxiv_ro 2026-07-01 ★★★☆☆

미분 가능한(differentiable) 쿼드로터 동역학을 활용해 이동 표적을 요격하는 민첩한 드론 비행 정책을 학습.

F.03 Arrives at BMW

figure_ai 2026-06-30 ★★★☆☆

Figure 03 휴머노이드가 BMW 제조 현장에서 실제 가동을 시작한 배포 사례.

Welcome to Robot Park: Apptronik's Apollo Goes to Work Training Humanoid Intelligence

apptronik 2026-06-30 ★★★☆☆

Apollo 휴머노이드가 데이터를 수집하고 구글 딥마인드(Google DeepMind)와 함께 학습하는 9만 sqft 규모 데이터 수집·훈련 시설 공개.

🔧 Models & Tools

9건

zai-org/GLM-5.2

hf_models ▲ 596 ★★★★☆

새 플래그십 오픈웨이트(open-weight) 대형 언어 모델 GLM-5.2 공개.

deepseek-ai/DeepSeek-V4-Pro-DSpark

hf_models ▲ 225 ★★★★☆

DeepSeek의 새 오픈 텍스트 생성 모델 DeepSeek-V4-Pro-DSpark 공개.

google/tabfm-1.0.0-pytorch

hf_models ▲ 189 ★★★☆☆

표 데이터(tabular) 분류를 위한 구글의 파운데이션 모델(tabular foundation model) tabfm-1.0.0 PyTorch 공개.

nvidia/LocateAnything-3B

hf_models ▲ 172 ★★★☆☆

이미지 내 임의 객체를 지시로 찾는 grounding·로컬라이제이션용 NVIDIA의 3B 멀티모달 모델 LocateAnything-3B 공개.

baidu/Unlimited-OCR

hf_models ▲ 512 ★★★☆☆

바이두의 이미지-텍스트 변환 기반 OCR 모델 Unlimited-OCR 공개.

deepreinforce-ai/Ornith-1.0-35B

hf_models ▲ 371 ★★★☆☆

강화학습(RL) 중심으로 학습된 35B 규모 오픈 텍스트 생성 모델 Ornith-1.0-35B 공개.

InternScience/Agents-A1

hf_models ▲ 228 ★★★☆☆

장기 궤적(long-horizon trajectory) 스케일링으로 35B 규모에서 조 단위(trillion-parameter)급 성능을 내는 에이전트 모델 Agents-A1 공개.

harvard-edge/cs249r_book

github ▲ 26544 ★★★☆☆

머신러닝 시스템(ML Systems) 기초를 다루는 하버드의 오픈 교재로 학습·배포 시스템 이해에 유용.

alibaba/page-agent

github ▲ 23073 ★★☆☆☆

자연어로 웹 인터페이스를 제어하는 인페이지 GUI 에이전트(page-agent) 오픈소스.

🏢 Industry & Analysis

3건

AI Is Designing Radio Chips That Humans Couldn't Even Imagine

ieee_spectrum 2026-06-24 ★★★☆☆

강화학습(RL)으로 인간 설계를 능가하는 5G·자율주행용 RFIC 무선 칩을 설계한 사례.

The Lab Mistake That Might Revolutionize Computing

ieee_spectrum 2026-06-29 ★★☆☆☆

실리콘 칩 위에 인공 뉴런(artificial neuron)을 구현한 우연한 발견으로 컴퓨팅 구조 혁신 가능성 제기.

Import AI 463: Self-improving robots; a 10k Chinese GPU cluster; and an elegiac essay for the human era

import_ai 2026-06-29 ★★☆☆☆

자기 개선(self-improving) 로봇, 1만 GPU 규모 중국 클러스터 등을 다룬 AI 연구 뉴스레터.

AI & Robotics ResearchDaily Digest

🧠 AI / ML

🤖 Physical AI & Robotics

🔧 Models & Tools

🏢 Industry & Analysis