Skip to content
Change the repository type filter

All

    Repositories list

    • vivaria

      Public
      Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
      TypeScript
      MIT License
      227422615Updated Jan 12, 2025Jan 12, 2025
    • Public repository containing METR's DVC pipeline for plot generation
      Python
      0000Updated Jan 11, 2025Jan 11, 2025
    • [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
      Python
      MIT License
      392001Updated Jan 11, 2025Jan 11, 2025
    • Shell
      1070Updated Jan 10, 2025Jan 10, 2025
    • TeX
      Other
      37202Updated Jan 9, 2025Jan 9, 2025
    • Python
      Other
      65112Updated Jan 9, 2025Jan 9, 2025
    • METR Task Standard
      TypeScript
      MIT License
      3213563Updated Jan 3, 2025Jan 3, 2025
    • Dockerfile
      0000Updated Dec 29, 2024Dec 29, 2024
    • Python
      0000Updated Dec 28, 2024Dec 28, 2024
    • Python
      1032Updated Dec 13, 2024Dec 13, 2024
    • .github

      Public
      0000Updated Nov 24, 2024Nov 24, 2024
    • nanoGPT

      Public
      The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      6.2k000Updated Nov 22, 2024Nov 22, 2024
    • SCSS
      MIT License
      4301Updated Nov 21, 2024Nov 21, 2024
    • Python
      0000Updated Nov 8, 2024Nov 8, 2024
    • Python
      1000Updated Nov 2, 2024Nov 2, 2024
    • pyhooks

      Public archive
      A library that METR agents use to communicate with Vivaria.
      Python
      1010Updated Sep 22, 2024Sep 22, 2024
    • vivaria-mentat

      Public archive
      Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
      TypeScript
      MIT License
      22011Updated Sep 19, 2024Sep 19, 2024
    • task-template

      Public template
      TypeScript
      5912Updated Aug 6, 2024Aug 6, 2024