The Data Science Lunch Seminar Series is an informal weekly gathering of NYU Data Science affiliated persons to discuss data science related topics. Each week there is a 30 minute presentation, over lunch (provided), with additional time for conversation and questions. The series meets Wednesdays from 12:30-1:30 PM at the Center for Data Science.



January 30, 2019
Narges Razavian, NYU Medical School + NYU CILVR lab

February 6, 2019
Mark Cartwright, NYU CUSP

February 13, 2019
Qing Qu, MSDSE Fellow

February 20, 2019
Will Geary, CitySwifter

February 27, 2019
Will Farr, CCA/Stony Brook

March 6, 2019
Matthew Daniels, The Pudding

March 13, 2019
Thomas Laetsch, MSDSE Postdoc

March 20, 2019
***No talk, NYU Spring Break***

March 27, 2019
Igor Labutov, LAER.AI

April 3, 2019

April 10, 2019
Guilio Biroli, Ecole Normale Superieure in Paris

April 17, 2019
Martina Balestra, NYU Tandon School of Engineering

April 24, 2019
Krzysztof Geras, NYU School of Medicine & CDS

May 1, 2019

May 8, 2019

May 15, 2019



2018-2019 Series

September 19, 2018
Roberto Di Cosmo, IRILL, “Software Heritage: The universal software source code archive”

September 26, 2018
Karen E Adolph, NYU Databrary Project, “Video as Data and Documentation Will Transform Behavioral Science”

October 3, 2018
Matthew Rocklin, Continuum Analytics, “Scalable Analysis in Python with Dask”

October 10, 2018
***No seminar (MSDSE Summit)***

October 17, 2018
Benny Nissan, Federal Reserve Bank of New York, “Solving Business Challenges with Data Science at the Federal Reserve of New York”

October 24, 2018
William Falcon, New York University, “Life outside the box – Scaling and tuning huge neural networks across
multiple GPUs”

October 31, 2018
Stefan Karpinski, MSDSE Research Engineer, “An Intro to Julia for Data Science”

November 7, 2018
Timo Ropinski, University of Ulm, “Interactive Visualization Techniques for Molecular Structures”

November 14, 2018
Sam McKenzie, Buzsaki lab, “Extracting synaptic properties from the spiking patterns of neural ensembles”

November 21, 2018
***No seminar (Thanksgiving)***

November 28, 2018
Joseph Bullock, Global Pulse / United Nation, “Satellite image analysis for humanitarian relief”

December 5, 2018
Elizabeth Bruch, University of Michigan, “How (and Why) Online Dating Experiences Differ across American Cities”

December 12, 2018
Sebastian Schelter, Moore-Sloan Data Science Fellow, “Data Management for Machine Learning (and a little bit of Computational Social Science)”

2017-2018 Series

September 13, 2017
Tassos Noulas, Moore-Sloan Fellow
“OpenStreetCab: Insights and Challenges when Deploying a Taxi Price Comparison App in the Wild”

September 20, 2017
Djellel Difallah, Moore-Sloan Fellow
“The Dynamics of Crowdsourcing Platforms: The case of Amazon Mechanical Turk and Wikidata”

September 27, 2017
Mirco Musolesi, UCL / Alan Turing Institute
“Modelling Human Behaviour using Mobile Data for Urban Analytics”

October 4, 2017
Carlos Fontaine, Center for Genomics & Systems Biology, NYU
“Using high-throughput live microscopy and image analysis to study cell cooperation in cancer”

October 11, 2017
Enrique Cruz, Foursquare
“Foursquare Location Intelligence”

October 18, 2017
Paula Parpart, College London

October 25, 2017
Chaya D Stern, Memorial Sloan Kettering Cancer Center and Tri-Institutional Chemical Biology Program
“Is the force with us? Quantifying model uncertainty in molecular simulations”

November 1, 2017
Kyle Cranmer, NYU

November 8, 2017
***No seminar (MSDSE Summit)***

November 15, 2017
Benjamin Cook, NASA Goddard Institute for Space Studies
Monitoring and Modeling Drought and Climate Change from the Past to the Future”

November 22, 2017
***No seminar (Thanksgiving)***

November 29, 2017
Marc Gourevitch

December 6, 2017
Ward Wheeler, American Museum of Natural History

December 13, 2017
Yelena Mejova, Qatar Computing Research Institute
“Capturing Social Media Signals for Lifestyle Health”

January 24, 2018
Andreu Casas, MSDSE Postdoc in the Center for Data Science
“More Effective Than We Thought: Accounting for Legislative Hitchhikers Reveals a More Inclusive and Productive Lawmaking Process”

January 31, 2018
Harish Doraiswamy, MSDSE Research Engineer in the Center for Data Science
“Speeding up queries over large spatio-temporal data sets”

February 7, 2018
Soledad Villar, MSDSE Fellow in the Center for Data Science
“Gerrymandering and Math”

February 14, 2018
Vladimir Gligorijevic, New York University
“Deep Multi-network Embedding for Protein Function Prediction”

February 21, 2018
Richard Galvez, MSDSE Postdoc in the Center for Data Science
“The First Three Minutes: an exposition on how nuclear elements formed in the early universe and an unsolved mystery involving dark matter and Beryllium-7”

February 28, 2018
Leto Peel, University of Louvain / Namur

March 7, 2018
Esteban Moro, Universidad Carlos III de Madrid / MIT

March 14, 2018
***No seminar (Spring Break)***

March 21, 2018
Leila Mays, NASA Goddard Space Flight Center, Community Coordinated Modeling Center

March 28, 2018
Neil Bramley, MSDSE Postdoc in the Center for Data Science

April 4, 2018
AI Now Institute

April 11, 2018
Adam Summers, Friday Harbor Laboratory, University of Washington

April 18, 2018
Geoff Boeing, UC Berkeley

April 25, 2018
Iddo Drori, NYU

May 2, 2018
Katharina Kann

May 9, 2018
Julia Stoyanovich, Drexel University
“Algorithms and Systems for Responsible Data Science”

May 16, 2018
Laura Noren
“Three Ethical Challenges for Data Science”

2016-2017 Series

September 14, 2016
Juliana Freire, Executive Director of Moore-Sloan Data Science Environment

September 21, 2016
Eleni Zacharatou, “Indexing the Brain”

September 28, 2016
Michael Z. Gill, “The Economic Benefits of Conflict? Estimating Defense Firm Responses to Major Events in U.S. Foreign Policy”

October 5, 2016
Ravi Shroff

October 12, 2016 (special time 1:00-2:00pm)

October 19, 2016
Anil Kocak

October 26, 2016
***No lunch, MSDSE 2016 Data Science Summit***

November 4, 2016 (special day, Friday)
Shankar Iyer, Quora

November 9, 2016
danah boyd, Microsoft Research

November 16, 2016
Max Sklar, Foursquare

November 21, 2016 (special day, Monday)
Mike Williams, Fast Forward Labs ***CANCELED***

November 30, 2016
Manuel Garcia-Herranz, UNICEF Innovation

December 7, 2016
Jason Anastasopoulos

December 14, 2016
Rossano Schifanella, University of Torino

February 1, 2017
Justin Salamon, “Audio Source Identification in Urban/Bio-Acoustic Environments”.

February 8, 2017
Alexander Bock, “OpenSpace — Moving from What to How in Science Communication”

February 15, 2017
Nicola Barbieri, Tumblr

February 22, 2017
Yannis Liodakis, “Revealing the intrinsic timescale distribution in the most active of galaxies”

March 1, 2017
Piotr Brodka, Wroclaw University of Science and Technology (Poland), “Current Challenges in the Area of Spreading Processes in Multilayer Networks”

March 8, 2017
Jaime Earnest, DoD, “Future State: Big Data and Analytics for Better Governance”

March 15, 2017
Spring Break – No lunch seminar

March 23, 2017, 1:30-2:30
Noemi Derzsy, RPI, NASA DataNauts, “Open NASA Data: From API to Data Analysis”

March 29, 2017
Partha Mithra

April 5, 2017

April 12, 2017
Yana Volkovich, AppNexus

April 19, 2017
Hannah Wallach, Data and Society

April 26, 2017
Ciro Cattuto, Fondazione ISI

May 3, 2017
Mor Naman, “Patterns of Large-Scale Attention”

MAY 10, 2017
Enrico Bertini

2015-2016 Series

November 9th
Meredith BroussardAssistant Professor, Journalism
Topic: Data for Democracy: Uncovering Stories in Government Data

November 16th
Juan BelloAssociate Professor, Music Technology

November 23rd
No meeting

November 30th
Mohamed ZahranClinical Associate Professor, Computer Science
Topic: Hardware Advances Benefiting Data Science

December 7th
No Meeting

December 14th
Emily Miraldi, Postdoctoral Researcher, Simons Center for Data Analysis

January 27, 2016
Lightning talks by Data Science Fellows and Postdocs, NYU, Center for Data Science

February 3, 2016 (special time: 12-1pm)
Heiko Müller, NYU, CDS

February 10, 2016
Kyle Cranmer, NYU, Physics

February 17, 2016
Lars Backstrom, Facebook

February 24, 2016 (special time: 12-1.30pm)
Joint PRIISM/CDS talk: Michael Betancourt, University of Warwick

March 2, 2016
Alfred Spector, CTO of Two Sigma

March 9, 2016
Uri Shalit, NYU, Computer Science

March 16, 2016
No seminar (Spring break)

March 23, 2016
Michael Blanton, NYU, Physics

March 31, 2016 (special day: THURSDAY)
Daniel Fernández, NYU, CDS

April 6, 2016
Rumi Chunara, NYU, CS & Engineering, Global Institute of Public Health

April 13, 2016
Andy Guess, NYU, Social Media Lab

April 20, 2016
Boris Leistedt, NYU, Center of Cosmology and Particle Physics

April 27, 2016
Sunandan Chakraborty, NYU, CDS

May 4, 2016
Daphna Harel, NYU, Steinhardt

May 11, 2016
Laura Norén, NYU, CDS

2014-2015 Series

October 28th
Ken Benoit, Visiting Professor from the London School of Economics
Topic: Quantitative Text Analysis for the Social Sciences (Using R): Natural language processing and quantitative analysis of social and political texts, using an R package called quanteda ( that he is developing.

November 5th
Jonathan GoodmanProfessor of Mathematics
Topic: The hermeneutics of MCMC sampling: points of view about samplers, some of which don’t depend on your point of view.

November 13th
Greg DoblerResearch Scientist at CUSP and a Research Assistant Professor of Physics at NYU
Better Cities through Imaging: I will describe how persistent, synoptic imaging of the NYC skyline can be used to better understand the city (in analogy to how persistent, synoptic imaging of the sky can be used to better understand the heavens), giving specific examples related to energy consumption, public health, and air quality which can lead to improved city functioning and quality of life.

December 1st
Pablo BarberaPhD Candidate in Politics
Topic: How Social Media Reduces Mass Political Polarization: 
Using a new method to estimate the ideological positions of social media users and their communication networks over time, I provide evidence that exposure to dissonant political messages on social media induces political moderation. For the paper, click here.

December 9th
Mike O’NeilAssistant Professor, School of Engineering and Courant Institute
Topic: Fast Algorithms for Gaussian Processes

February 5th
Ying LuAssistant Professor of Applied Statistics at NYU and
Dr. Preeti RaghavanAssistant Professor at NYU Medical Center
Topic: Quantifying Feedforward Control: A Linear Scaling Model for Fingertip Forces and Object Weight
In this paper, we fit a linear growth curve to the biomechanical data of grasping  in order to understand the relationship between fingertip forces and object weight among healthy subjects. Our results show evidence of feedforward control during the grasping task when healthy subject grasp/lift an object with familiar weights.

February 23rd
Daniela HuppenkothenMoore Sloan Data Science Fellow at NYU
Topic: Exploring the Violent Universe: Astrophysical Data (Analysis) at High Energies
This talk will give an introduction into astronomy at very short wavelengths, where we see the universe’s most extreme phenomena, and showcase typical data analysis problems—some of which we are working on at the CDS—which we have to solve in order to understand the underlying physical processes.

March 3rd
Aaditya RanganAssociate Professor of Mathematics at NYU
Topic: Efficient Methods for Detecting Low Rank Structure
Discussion of some methods for finding low-rank submatrices within gene-expression data; It will be necessary to correct for case-control status as well as covariates.

March 11th
Tae Hong ParkAssociate Professor of Composition and Music Technology at NYU
Topic: CityGram: Sensing Urban Soundscapes
Dynamic mapping of non-ocular spatio-acoustic energies.

March 27th
Lakshminarayanan SubramanianAssociate Professor of Computer Science at NYU
Topic: Societal Network Science
Predicting Socio-economic indicators from real-time news streams

April 8th
Patrick WolfeProfessor of Statistics and Honorary Professor of Computer Science at University College London
Topic: Network Analysis and Nonparametric Statistics
Networks are ubiquitous in today’s world.  Any time we make observations about people, places, or things and the interactions between them, we have a network.  Yet a quantitative understanding of real-world networks is in its infancy, and must be based on strong theoretical and methodological foundations.  The goal of this talk is to provide some insight into these foundations from the perspective of nonparametric statistics, in particular how trade-offs between model complexity and parsimony can be balanced to yield practical algorithms with provable properties.

April 13th
Karen AdolphProfessor of Psychology and Neural Science at NYU
Topic: Databrary and Datavyu: Coding, Sharing, and Repurposing Video
A description of the promises and challenges of coding, sharing, and repurposing video data

April 23rd
Yoni HalpernGraduate Student in the Computer Science Department at NYU
Topic: Provable Methods for Discrete Factor Analysis
This talk will cover methods for structure and parameter learning in latent variable models and discuss an application for learning diagnosis models from unstructured data in electronic medical records.

April 29th
Joshua TuckerProfessor of Politics at NYU
Topic: Using Social Media to Study Politics
How can we use social media to study the political behavior of masses and elites?  What we can do, what we can sort of do, and what we’d like to be able to figure out how to do…

May 5th
Dan BrownAssociate Professor of Computer Science at the University of Waterloo
Topic: Do better lyrics make a hit?  Complexity of rhymes in music lyrics

2013-2014 Series

September 14th
Jean-Daniel Fekete, Senior Research Scientist, INRIA Saclay
Topic: Progressive Analytics: A New Programming Paradigm for Large-Scale Data Exploration

September 21st
Kyunghyun ChoAssistant Professor, Computer Science and Data Science
Topic: Lost in Interpretability
I hear from non-ML as well as ML researchers once a while that they find it difficult to work with neural networks due to the lack of interpretability compared to other simpler models which are in many cases some variants of linear models. This has always made me wonder if there is a natural tradeoff between interpretability and modelling accuracy. In this talk, however, I argue that no such tradeoff exists, which has been argued already more than a decade ago by Breiman (2001). Furthermore, I present some recent results from deep learning showing that seemingly complex models, such as deep neural nets and recurrent neural nets, are in fact interpretable, if we try hard enough.

September 28th
Arthur Spirling, Associate Professor, Politics and Data Science

October 5th
No meeting

October 12th
Kenneth J. Kurtz, Associate Professor, Psychology at SUNY Binghamton

October 19th
No meeting, see special time Thursday of that week

October 22nd (Thursday, special time)
Bingni BruntonAssistant Professor, Biology and Data Science at University of Washington

October 26th
Rachid OunitPh.D Candidate, Computer Science, UC Riverside
Topic: Classification of metagenomic sequences: how to make it faster and more accurate

November 2nd
Dan CervoneData Science Fellow, Center for Data Science