Lawrence Jang

Contact: ljang [at] cs [dot] cmu [dot] edu

Lawrence.jpeg

I am a first-year Machine Learning PhD student at Carnegie Mellon University advised by Ruslan Salakhutdinov. I am interested in computer-use agents, real-time personal assistants, and the future of human-computer interaction through the lens of personal superintelligence.

My recent work introduces benchmarks for phone, computer-use, and web agents — including iOSWorld, MyPCBench, and Odysseys.

selected publications

  1. iosworld.png
    iOSWorld: A Benchmark for Personally Intelligent Phone Agents
    Lawrence Keunho Jang, Mareks Woodside, Geronimo Carom , and 3 more authors
    In ArXiv Preprint , 2026
  2. mypcbench.png
    MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents
    Lawrence Keunho Jang, Andrew Keunwoo Jang, Jing Yu Koh , and 1 more author
    In ArXiv Preprint , 2026
  3. odysseys.png
    Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks
    Lawrence Keunho Jang, Jing Yu Koh, Daniel Fried , and 1 more author
    In ArXiv Preprint , 2026
  4. tac.png
    TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
    Frank F. Xu, Yufan Song, Boxuan Li , and 18 more authors
    In NeurIPS , 2025
  5. bgym.png
    The BrowserGym Ecosystem for Web Agent Research
    Thibault Le Sellier De Chezelles, Maxime Gasse, Alexandre Drouin , and 17 more authors
    In TMLR , 2025
  6. headerFINAL.png
    VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
    Lawrence Jang, Yinheng Li, Charles Ding , and 5 more authors
    In ICLR , 2025
  7. waa.png
    Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
    Rogerio Bonatti, Dan Zhao, Francesco Bonacci , and 9 more authors
    In ICML , 2025
  8. ical.png
    ICAL: Continual Learning of Multimodal Agents by Transforming Trajectories into Actionable Insights
    Gabriel Sarch, Lawrence Jang, Michael Tarr , and 3 more authors
    In NeurIPS Spotlight , 2024
  9. vwaNEW.png
    VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
    Jing Yu Koh, Robert Lo, Lawrence Jang , and 7 more authors
    In ACL , 2024