AI2 Adapt Dev

community

AI & ML interests

Open science can (maybe) save the world

Recent Activity

akariasai authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

faezeb authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

pradeepd authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

akariasai

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 11 days ago • 54

faezeb

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 11 days ago • 54

pradeepd

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 11 days ago • 54

shannons

authored 2 papers 11 days ago

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

Paper • 2406.07835 • Published Jun 10, 2024 • 1

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10 • 14

hamishivi

authored 2 papers 11 days ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published 26 days ago • 13

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 11 days ago • 54

shannons

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 11 days ago • 54

ljvmiranda921

authored a paper about 1 month ago

FilBench: Can LLMs Understand and Generate Filipino?

Paper • 2508.03523 • Published Aug 5

DongfuJiang

authored 2 papers about 2 months ago

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Paper • 2509.22824 • Published Sep 26 • 20

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26 • 25

DongfuJiang

authored a paper 3 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

valpy

authored 4 papers 5 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance

Paper • 2502.08395 • Published Feb 12

RewardBench 2: Advancing Reward Model Evaluation

Paper • 2506.01937 • Published Jun 2 • 7

Generalizing Verifiable Instruction Following

Paper • 2507.02833 • Published Jul 3 • 1

saumyamalik

authored 3 papers 6 months ago

QuRating: Selecting High-Quality Data for Training Language Models

Paper • 2402.09739 • Published Feb 15, 2024 • 4

Lost in the Logic: An Evaluation of Large Language Models' Reasoning Capabilities on LSAT Logic Games

Paper • 2409.19012 • Published Sep 23, 2024

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

ljvmiranda921

authored a paper 6 months ago

R3: Robust Rubric-Agnostic Reward Models

Paper • 2505.13388 • Published May 19 • 11