Testing and Evaluation of Intelligent Systems
A core mission of the ISC is rigorous testing and evaluation of fundamentally new AI and autonomy to address critical national challenges, integrating APL’s trusted technical advisor role with a leading, interdisciplinary research program in AI, robotics, and autonomy. Novel datasets, benchmarks, metrics, and evaluation frameworks and tools are regularly released.
Related Publications
- Johnson, E. C., E. Q. Nguyen, B. Schreurs, C. S. Ewulum, C. Ashcraft, N. M. Fendley, M. M. Baker, A. New, G. K. Vallabha, “L2Explorer: A Lifelong Reinforcement Learning Assessment Environment,” (2022).
- Fendley, N., C. Costello, E. Nguyen, G. Perrotta, C. Lowman, “Continual Reinforcement Learning with TELLA,” Conference on Lifelong Learning Agents (CoLLAs) 2022, . (2022).
- New, A., M. Baker, E. Nguyen, G. Vallabha, “Lifelong Learning Metrics,” . (2022).