From October 23-26, 2016, the New York University Moore-Sloan Data Science Environment Program welcomes MSDSE partners, the University of California-Berkeley and the University of Washington, to build community, explore ideas, and encourage collaboration between our three programs.

The 2016 Summit will take place at the Mohonk Mountain House in New Paltz, NY from Oct. 23-26.  All participants will then join the Summit with industry leaders in data science at New York University in New York City on Wednesday, Oct. 26.

Data Summit Contact Info (please don’t hesitate to reach out with any questions, etc):




This year’s Summit is focused around “building community around data science for research.”


Sunday, Oct. 23:

3:30 PM Bus departs NYU Kimmel Center
5:30 PM Bus departs Newark airport
8:30 PM Bus departs Newark airport
7:00-10:00 Check in, dinner at Mohonk
11:00/11:30 A la carte/food, box dinners available for 8:30 bus arrivals

Monday, Oct. 24:

7:30-8:30 Buffet Breakfast
9:00-9:10 Opening Remarks, Juliana Freire
9:10-9:40 WG Pitch (Education WG and Reproducibility WG leads)
Education WG Breakout: Data Science Curriculum and Content (Leader: Magdalena Balazinska)
Education WG Breakout: Implementing DS Education Programs in Existing Ecosystems (Leader: Arthur Spirling)
Education WG Breakout: Ad Hoc Data Science Education (Leader: Anthony Suen)
Reproducibility WG Breakout: Case Studies (Leader: Philip Stark)
Reproducibility WG Breakout: Badges (Leader: Allison Smith)
Reproducibility WG Breakout: Education/Evaluation (Leader: Juliana Freire, Kyle Cranmer)
Education WG Breakouts report back about topical goals for the next year
Reproducibility WG Breakouts report back about topical goals for the next year
Education WG Annual Report outline
Reproducibility WG Annual Report outline
11:35-12:00 Working groups report back to full group
12:00-1:00 Buffet Lunch
1:00-3:00 Open time (Coordinated group activities)
2:30 Coffee and snacks available
Brenden Lake, “Extracting Knowledge from Data: What can we learn from the mind and brain?” (breakout)
Ali Ferguson, “Communicating Within and Across the DSE” (breakout)
Jose Manuel Magallanes, “Conceptualizing Computational Policy Science” (breakout)
Jake VanderPlas, “XHackWeek” (roundtable)
Kyle Barbary/Juliana Freire, “Debugging Science with Workflow Tools” (breakout)
Jacob Schreiber, “Pomegranate: Fast and Flexible Probabilistic Modelling in Python” (tutorial)
Micaela Parker and Sarah Stone, “Data Science for Social Good Summer Program” (roundtable)
Laura Norén, “Data Science as Organizational Change” (roundtable)
Bijan Pesaran, “Neural Data Science — Databases, Models, and Real-Time Systems” (breakout)
Dani Ushizima, “Image Across Domains (Image XD)” (roundtable)
Rob Fatland, “Bootstrapping Student Research Developers” (roundtable)
Vasant Dhar, “Technological, Legal, and Regulatory, issues in Autonomous Transportation” (roundtable)

Please sign-up here.

5:05:5:15 Break
Tyler McCormick, “How Do I Know If I’m Doing Computational Social Science?” (birds-of-a-feather)
Vicky Steeves, “Libraries Working Group Meet-Up” (birds-of-a-feather)
Jasmine Nirody, “Not Just -Omics” (birds-of-a-feather)
Bernease Herman, “Entrepreneurship and Academic Data Science” (birds-of-a-feather)
Brett Naul, “Problems in Time Series Inference” (birds-of-a-feather)
David Beck, “Increasing Diversity Across Data Science Fields” (birds-of-a-feather)
6:15-7:30 Dinner
7:30-? Happy hour reception + lightning talks:
***7:30-7:35 LT 1: Kellie Ottoboni, “Simple Random Sampling: Not So Simple”
***7:35-7:40 LT 2: Rumi Chunara, “Living on the Edge: Interdisciplinary Data Science Work”
***7:40-7:45 LT 3: Joseph Hellerstein, “Teach data design before coding”
***7:45-7:50 LT 4: Dani Ushizima, “Dementia and Multimodal Brain Imaging: early detection on the way?”
***7:50-7:55 LT 5: Brenden Lake, “Learning from small data: How human learning can inform data science”
***7:55-8:00 LT 6: Jacob Schreiber, “Parallelized Out-of-Core Mixture Modelling in pomegranate”
***8:00-8:05 LT 7: Jamie Whitacre, “Jupyter Lightning Talk”
***8:05-8:10 LT 8: David Hogg, “The Cannon: A data-driven model of stars”
***8:10-8:15 LT 9: Magdalena Balazinska, “Database support for image analytics at scale”
***8:15-8:20 LT 10: Ariel Rokem, “Building systems for analyzing big neuroscience imaging data”
***8:20-8:25 LT 11: Erika Lage, “Turning Insurance Claims Data into Insights for Multiple Myeloma”

Tuesday, Oct. 25:

7:30-8:30 Buffet Breakfast
9:00-9:10 Opening Remarks, Juliana Freire
9:10-9:40 WG Pitch (Software and Careers WG leads)
Software WG Breakout: Writing JOSS papers and discussing alternate publication venues for software (Leader: Jake VanderPlas)
Software WG Breakout: Research Software Engineers as career paths (Leader: Claudio Silva)
Software WG Breakout: Prototyping tools for extracting software metrics from Github (Leader: Fernando Perez)
Careers WG Breakout: Dual mentoring and joint appointments for tenure track faculty: (How) Is this working? (Leader: Ed Lazowska)
Careers WG Breakout: Non-tenure-track appointments and “professors of practice” (Leader: Henry Brady)
Careers WG Breakout: Post Moore-Sloan sustainability of fellow, postdoc and other research positions (Leader: Mik Laver)
Software WG Breakouts report back about topical goals for the next year
Careers WG Breakouts report back about topical goals for the next year
Software WG Annual Report outline
Careers WG Annual Report outline
11:35-12:00 Working groups report back to full group
12:00-1:00 Buffet Lunch
1:00-3:00 Open time (Coordinated group activities)
2:30 Coffee and snacks available
Karthik Ram, “Hackroulette” (tutorial);

Submit ideas here

Nick Adams, “Human-Centered Data Science” (breakout)
David Hogg, “Making Better Figures” (tutorial)
Abt, “Data Scientists—tell us what works for you, and what you need next!” (breakout)
Stuart Geiger, “Data Science Career Paths and Prospects Survey” (breakout) – Careers WG presents survey results
Garret Christensen, “Reproducibility Case Studies from the Social Sciences” (breakout)
Spaces: Overview, Objectives, and Programs
Data Science Studies: Diversity and Inclusivity (combine with Laura Norén and Daniela Huppenkothen’s “Inclusivity Workshop”)
Fernando Perez, “Project Jupyter” (breakout)
Jake VanderPlas and Stefan Karpinski, “Data Science Programming Languages” (tutorial)
Neil Davies, “Data Science for Effective Local Response to Global Change” (breakout)
5:05:5:15 Break
5:15-6:00 Lightning talks:
***5:15-5:20 LT 1: Alexandra Paxton, “The Center for Data on the Mind: Helping cognitive scientists leverage big data to explore cognition and behavior”
***5:20-5:25 LT 2: Valentina Staneva, “Toward Observable Data Observatories”
***5:25-5:30 LT 3: Michael Gill, “Understanding Firm Responses to Major Events in U.S. Foreign Policy”
***5:30-5:35 LT 4: Garret Christensen, “Geographic Variation in Food Prices and SNAP Benefits”
***5:35-5:40 LT 5: Tyler McCormick, “Statistics for social and economic networks”
***5:40-5:45 LT 6: Sunandan Chakraborty, “Enforcement Gap Interface: A Computational Model to Detect Illegal Wildlife Sales Online”
***5:45-5:50 LT 7: Kevin Koy, “ImageXD: Image Processing Across Domains”
***5:50-5:55 LT 8: Chris Holdgraf, “Using MNE-python to study electrical signals in the brain”
***5:55-6:00 LT 9: Kevin Munger, “Tweetment Effects on the Tweeted”
6:15-7:30 Dinner
6:15-8:15 Core team meeting
7:30-? Happy hour reception + posters/demos

Wednesday, Oct. 26:

This event at NYU on Wednesday will bring together industry leaders, funders, government agencies, and data science practitioners and academics from a broad range of institutions and fields, from life, physical, and social sciences to the computational, statistical, and mathematical disciplines. We expect more than 200 people to come together to explore what it means to advance data science for research.

From 7:30 to 10:30 pm, we will unwind at 230 Fifth Rooftop Bar (230 Fifth avenue, corner of 27th street). Cocktails, refreshments and Hors d’Oeuvres will be served.

08:45 Depart Mohonk
11:30-12:30 Check in to hotel, leave bags with hotel; Filter in to Rosenthal; Buffet lunch at Rosenthal
1:00-1:10 Opening remarks – Dr. Katherine Fleming
1:10-1:35 Overview of MSDSE – Chris Mentzel and Josh Greenberg
1:35-1:50 Highlights from the 3 DSEs
1:50-2:35 Careers panel
2:35-3:45 Lightning talks + Posters/Demos
***2:55-3:00 LT 1: Jake VanderPlas (CalPoly/Sloan/Moore): Altair: Declarative Statistical Visualization in Python
***3:15-3:20 LT 2: Holly Bik: Phinch (UC Riverside/Sloan): An interactive, exploratory data visualization framework for –Omic datasets
***3:35-3:40 LT 3: Nick Adams (UC Berkeley): AI Requires Social Scientists and Crowds
3:45-4:05 Keynote: Tracy Teal, “Democratizing data skills to advance data-driven-discovery”
4:05-5:30 Lightning talks + Posters/Demos + wine/cheese
***4:15-4:20 LT 4: Mario Juric (UW): The Large Synoptic Survey Telescope: Ushering the Age of Data-driven Astronomy
***4:35-4:40 LT 5: Kerstin Kleese van Dam (Brookhaven): Streaming Data Analysis for Highly Correlated Events
***4:55-5:00 LT 6: Alyssa Goodman (Harvard): Linked-View Visualization of High-Dimensional Data in glue
***5:15-5:20 LT 7: Fernando Chirigati (NYU): Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets
5:30-5:50 Keynote: Chris Ré, “DeepDive and Snorkel: Dark Data Systems to Answer Macroscopic Questions”
5:50-6:00 Closing remarks – DSE Leadership
6:00-7:00 Transit to bar
7:00-9:00 230 Fifth rooftop event

Traveling to Mohonk Mountain House

1000 Mountain Rest Road, New Paltz, NY 12561

Please Note: A bus will be leaving Kimmel Center (60 Washington Square S, New York, NY 10012) for Mohonk on Sunday 10/23 at 3:30 p.m.

Stay tuned for RSVP details!

By car from NYC:

  1. Go north on the NYS Thruway (I-87 north) to Exit 18, New Paltz.
  2. After leaving the NYS Thruway at Exit 18, turn left at the traffic light onto Route 299/Main Street.
  3. Follow Main Street through New Paltz. Immediately after crossing the bridge over the WallKill River, turn right (follow “Mohonk” sign).
  4. After 1/4 mile bear left at the fork and follow Mountain Rest Road and Mohonk signs to the Gatehouse. (Note: The Mohonk Gatehouse is located about six miles west of the Thruway Exit).



  • Train service is available via Metro North and Amtrak trains to the Poughkeepsie Train Station. MTA Schedules and Fares
  • The train station is 18 miles away from Mohonk via US-299 and US 9N” For directions, please use this link: Train station to Mohonk


  • Port Authority in New York City to New Paltz Bus Station.
  • The bus station is 7 miles away from Mohonk. For directions, please use think: Bus station to Mohonk

Local Car Service (Newburgh, NY)

  • All Transportation Network: (800) 258-9879 or book online at

Mohonk Contact Information:

Front Desk: (845) 256-2016

Reservations: (855) 833-3798

Please Note: As part of your reservation, Mohonk includes several complimentary recreational activities. Click here to check out all of the activities that Mohonk has to offer!


Github Code of Conduct


Check back soon for details about events, projects, hack sessions, and more regarding what will surely be another inspiring event!