EvalAI: Evaluating AI systems at scale

Deshraj

Title:

EvalAI: Evaluating AI systems at scale

Files

DESHRAJ-THESIS-2018.pdf (13.97 MB)

Author(s)

Deshraj

Advisor(s)

Batra, Dhruv

Advisor(s)

Person

Parikh, Devi

Person

Batra, Dhruv

Associated Organization(s)

Organizational Unit

College of Computing

Organizational Unit

School of Computer Science

Collections

Theses and Dissertations

Permanent Link

http://hdl.handle.net/1853/60738

Abstract

Artificial Intelligence research has progressed tremendously in the last few years. There has been the introduction of several new multi-modal datasets and tasks due to which it is becoming much harder to compare new algorithms with existing ones. To solve this problem, this thesis introduces EvalAI, an open source platform for evaluating and comparing machine learning and artificial intelligence algorithms at scale. This platform is built to provide an open source, standardized, scalable solution for evaluating learned models using automatic metrics as well as with human-in-the-loop evaluation. By simplifying and standardizing the process of benchmarking, EvalAI seeks to lower the barrier to entry for participating in the global scientific effort to push the frontiers of machine learning and artificial intelligence, increasing the rate of measurable progress in these communities.

Date Issued

2018-12-06

Resource Type

Text

Resource Subtype

Thesis

Full item page

Title:

EvalAI: Evaluating AI systems at scale

Files

Author(s)

Authors

Advisor(s)

Advisor(s)

Editor(s)

Associated Organization(s)

Series

Collections

Supplementary to

Permanent Link

Abstract

Sponsor

Date Issued

Extent

Resource Type

Resource Subtype

Rights Statement

Rights URI

Georgia Tech Library

Title: EvalAI: Evaluating AI systems at scale

Files

Author(s)

Authors

Advisor(s)

Advisor(s)

Editor(s)

Associated Organization(s)

Series

Collections

Supplementary to

Permanent Link

Abstract

Sponsor

Date Issued

Extent

Resource Type

Resource Subtype

Rights Statement

Rights URI

Title:

EvalAI: Evaluating AI systems at scale