Carnegie Mellon's Department of Statistics and

alternative text


Tartan Data Science Cup:
Episode III -- Capital Rogue One

Congratulations, Winners!

1st Place: "paranormal" (Sally McNichols, Andy Liu)
2nd Place: "Anything Random?" (Kweonwoo Jung, Maria Rodriguez De La Cruz, Nikita Gupta)
Best Data Visualization: "Bae's theorem" (Lina Sheremet, Joey Gibli, L├ęShaun Jones)
Best Technical Writing: "Mitophace " (Christopher Morris, Grace Yu, Michael You)

Episode III Competition Information

The problem for Episode III can be found here.

The data for Episode III can be found here.

The submission link for Episode III can be found here.

We are excited to partner with Capital One for this episode of the TDSC. Student participants will analyze data relating to Capital One's business focuses and present their results to a team of judges from the company. Capital One's team of employees will be on campus and at the competition to engage with participants, answer any questions, and judge the finalists.

Episode III -- Capital Rogue One

Information Session: Tuesday, January 31st, 2017, 4:30-5:30pm, Porter Hall 125C

The information session will allow students to ask any questions they have about the competition. This is also an opportunity for students to form teams and register for the competition. Students do not have to attend the information session to participate.

Pizza and drinks will be provided!
Specific information about the data or research topics, however, will not.

Registration Deadline: Wednesday, February 1st, 2017, 11:59pm

All currently enrolled Carnegie Mellon University undergraduate students on the Pittsburgh campus are eligible to participate. All teams must include two or three students; students can only participate on one team. All student names and Andrew IDs must be included when registering. Registration must also include a (non-identifying) team name.

To register, click here.

Release the Data! Thursday, February 2nd, 2017

The dataset will be available on Thursday evening but without details about the specific competition questions. Participants should try to do some exploratory data analysis prior to the competition in order to focus their efforts on Saturday.

Reception with Capital One: Friday, February 3rd, 2017, 7-9pm, CUC Connan Room

Please join us at a reception sponsored by Capital One to network and talk about statistics, data analytics, or the TDSC!

Capital One employees will also be at the reception to answer questions about the dataset and employment opportunities. Stop by!

Competition Begins! Saturday, February 4th, 2017, 9am, Scaife Hall 125

The research problem and competition question(s) will be released on this website at 9am. Students are welcome to work anywhere, but Scaife Hall 125 will be open all day as the TDSC Homebase. TDSC organizers will also be available during the day to answer questions.

A light breakfast will be provided in the TDSC Homebase for participants at 8:30am.
Lunch will be provided for participants at 12pm.

Time's Up! Saturday, February 4th, 2017, 5pm

At 5pm, submissions are due. More details about submissions will be available the day of the competition.

Submission constitutes permission to post winning team entries online (under non-identifying team name).

Presentations and Final Results: Saturday, February 4th, 2017, 7pm, Giant Eagle Auditorium (Baker Hall A51)

There will be a panel of judges from Capital One and the Department of Statistics. The judges will review the code, reports, and slides from 5-7pm and then watch the slide presentations at 7pm. Students are encouraged to practice their presentations over the 5-7pm dinner break.

The top 8 teams will be given five minutes to present their methods and results to the judges, the other teams, and anyone else who wishes to attend. Teams can have up to three slides, but be careful -- you will be cut off after exactly five minutes! Teams outside of the top 8 are still eligible to win other prizes and encouraged to stay and watch the final presentations.

The judging criteria include:

  • Code: Is the submitted code reproducible, well-documented, correct, and easy to understand?
  • Report: Does the submitted report describe the problem, methods, and results in a clear and concise manner?
  • Presentation: Did the team communicate their problem, methods, and results to an uninformed audience effectively?
  • Results: Do the team's results sufficiently and effectively answer the research problems presented?

Prizes (sponsored by Capital One)

alternative text

1st Place Team: Apple TV and $50 Amazon Gift Card for each team member
Best Data Visualization: $50 Amazon Gift Card for each team member
Best Technical Writing: $50 Amazon Gift Card for each team member

Additionally, the 1st place team will receive the Tartan Data Science Cup. After each competition, the Cup is presented to the winning team, who are allowed to keep the cup and gloat for a short period of time. Members of the winning team will have their names engraved onto the Cup.

TDSC Organizer Contact Information:

Sam Ventura (, Rebecca Nugent (