February 21st, 2015

PATH Data Jam

St Peter's University

Jersey City, NJ, USA

See the Winners!

What is a Data Jam?

It's a day long digital workshop in which computer programmers, graphic designers, interface designers, project managers, community activists, artists and other interested members of the community to collaborate intensively on solving problems focused on specific data. The outcome produces visualizations, analytics and apps based on this data.

View details »

Program

We will start a morning with great speakers from the Port Authority, Regional Plan Association and St Peter's Unversity that will introduce the data and challenges. Around noon you will join a breakout team of your choice. We will close the day with presentation of the projects created during the event.

View details »

Data

For the PATH Data Jam, all projects are required to make direct use of the data, or reference an insight derived from one of the provided datasets. We provide links to PATH turnstile data, NY Waterway ridership data, Trans Hudson ridership data (courtesy of Regional Plan Association), and links to GTFS files, but you are welcome to use other sources.

View details »


What is a Hackathon

What is a Hackathon?

A hackathon is a digital workshop in which computer programmers, graphic designers, interface designers, project managers, community activists, artists and other interested members of the community to collaborate intensively on solving problems.

Hackathons typically start with one or more presentations about the event, as well as about the specific subject, followed by allowing the participants to suggest ideas and form teams, based on individual interests and skills. Then the main work of the hackathon begins, which can last anywhere from several hours to several days. At the end of the hackathon, there is usually a series of demonstrations in which each group presents the outcome of their collaboration. There is sometimes a contest element as well, in which a panel of judges select the winning teams, and prizes are given. At many hackathons, the judges are made up of organizers and sponsors.

What is a Data Jam?

Is a type of hackathon focused on a specific data sets. The outcome produces visualizations, analytics and apps based on this data.


Speakers

Robert Galvin

Robert Galvin

Chief Technology Officer

Port Authority of NY & NJ

@regalvin

View bio »

Robert Galvin joined the Port Authority of New York and New Jersey in December 2013 as the agency’s Chief Technology Officer. Mr. Galvin has over 25 years experience in both public and private sectors, including leadership roles at one of the largest specialty design-build contractors in the world – and as an executive at a leading supplier of ERP and field service management software for the construction industry. Prior to joining the Port Authority, Mr. Galvin was CTO of the School Construction Authority where he reorganized the Technology department and modernized their enterprise applications by deploying data warehousing, cloud, and mobile technologies. 

Courtenay Mercer

Courtenay Mercer

NJ Director

Regional Plan Association

@courtenaynj

View bio »

Courtenay Mercer leads RPA’s research, planning, and advocacy activities in New Jersey. An expert in private- and public-sector land use, Courtenay has extensive experience in planning issues in New Jersey. Prior to RPA, she owned a planning consulting firm and also has served as the director of planning in the New Jersey Office of Smart Growth. Courtenay has been involved in several key New Jersey policy initiatives, including the State Plan, the Highlands Regional Master Plan and the state transfer of development rights program. Courtenay currently serves on the American Planning Association’s board of directors, and is past-president of APA’s New Jersey chapter. Courtenay is a New Jersey-licensed Professional Planner and certified by the American Institute of Certified Planners. She has a bachelor’s degree from Rutgers University and a master’s in city and regional planning from the Edward J. Bloustein School of Planning and Public Policy at Rutgers.

Joel Natividad

Joel Natividad

CEO & Co-founder

Ontodia

@jqnatividad

View bio »

Joel Natividad is co-founder of Ontodia - a civic tech startup born out of Open Data. Even before winning NYCBigApps for the second time, Joel and his long-time friend and collaborator, Sami Baig, quit their jobs, convinced that Open Data - data about us, for us, and paid by us, has the Biggest impact. Not only will it improve and enable Gov 2.0, it will enable new business and business models that will maximize returns to society profitably. Joel is a civic hacker and entrepreneur who did not finish his MS in Computer Science. He is an active open source contributor to multiple open source projects including CKAN. Joel is a charter member of the Open Semantic Data Association Steering Committee, and infrequent speaker at several Semantic, Open Data, and Civic Tech gatherings. Before Ontodia, Joel headed the Knowledge Engineering Practice of a small consultancy, TCG, creating knowledge systems for Fortune 100 companies in lifesciences and finance.

Emmanuel A Simon

Emmanuel A Simon

Founding Partner

OpenJC

@uelsimon

View bio »

Emmanuel is a product and innovation strategist with experience in digital marketing, brand design, strategic marketing and market analysis. He defines the strategy and manages the evolution of the product to ensure the final deliverable meets the business requirements set by the client. He is also a creative UX director with experience defining award winning user experiences. He has helped create digital experiences for numerous top media, entertainment, education & publishing brands while at top ten NYC interactive agency - Code and Theory, and Method Inc. Emmanuel has also been part of the marketing team focused on the higher education sector for Apple, Inc. He received his B.Sc in Aerospace Engineering from Tuskegee University, and an MBA from Harvard Business School.


Judges

Philip Plotch

Philip Plotch, PhD, AICP

Ass Professor & Public Admin Program Director

Saint Peter's University

@profplotch

View bio »

Philip Mark Plotch is a political science professor and the director of the master in public administration program at Saint Peter’s University. Dr. Plotch has held numerous positions in state government including director of World Trade Center redevelopment and special projects at the Lower Manhattan Development Corporation, and the manager of planning and policy at the Metropolitan Transportation Authority (MTA). Dr. Plotch has taught transportation planning courses at Hunter College and in August 2015, Rutgers University Press will publish his book, "Politics Across the Hudson: The Tappan Zee Megaproject." Dr. Plotch majored in computer science as an undergraduate, and later received his master’s degree in urban planning from Hunter College and his Ph.D. in urban and public policy from the New School. For more information, please see www.plotch.com and www.twitter.com/profplotch

Otis Rolley

Otis Rolley

President and CEO

Newark Community Economic Development Corp

@OtisRolley

View bio »

Otis Rolley III has spent his career working toward advancing community and economic development in the public, private, and non-profit sectors. Prior to join the administration of Mayor Ras Baraka as the President and CEO of the Newark Community Economic Development Corporation (Newark CEDC), Otis was a senior manager at national public-sector management consultant firm Urban Policy Development, based in Baltimore. He also was the founding president of a regional nonprofit, the Central Maryland Transportation Alliance, dedicated to improving and expanding transit and transportation options for the people of Central Maryland. His extensive public sector experience includes serving in various leadership positions with three different mayors. He has been a chief of staff, city planning director, and first deputy housing commissioner. Otis graduated with honors from Rutgers College with a B.A. in Political Science and Africana Studies, and from Massachusetts Institute of Technology with a Masters in City Planning, where he concentrated in housing and community economic development.

Elnardo Webster

Dr Elnardo Webster

21st CCLC Director

St Peter's University

ewebster@saintpeters.edu

View bio »

Dr. Elnardo J. Webster is a native of Newark, a former student of Peshine Avenue School and Weequahic High School.After moving to Jersey City, he graduated from Lincoln High School where he was an all-star basketball player. Dr. Webster received his BA degree from St. Peter's College where he was the most valuable player in the All MAC Conference for two years; All Big East for two years; a Helms Foundation All American; All NIT for two years; and All NIT record holder (51 points in one game).He played professional basketball for seven years with various teams: New York Nets, Memphis Pros, New York Knicks, and in several European countries: Italy, Spain, and Switzerland.He received his doctorate degree from Seton Hall University in South Orange. Dr. Webster has been inducted into the Hall of Fame at Saint Peters College, the Hudson County Hall of Fame, and after playing for the Essex County Old Timers League for many years, was inducted into the National Old Timers Hall of Fame. Currently the Director of the Newark Public Schools Extended School Day Programs, the After School Youth Development Program has grown, during his 13 years of supervision, from serving 1,500 students at 25 sites to serving over 10,000 students daily at over 70 sites.His after school programs are considered to be a national model.As Director of the Office of Extended School Day Programs, Dr. Webster can boast of countless achievements, but he is most proud of the fact that some of the best academically performing and improved schools in the district frequently attribute their success to participation in his Programs.He was recently selected to serve as a National Ambassador of After- school Programs. Dr. Webster experienced as much success during his eight years with the New Jersey Department of Corrections when the program that he developed as Director of Recreational Services was recognized as " Program of the Year".He is also a former Freeholder of Hudson County.


Winners

Team Yulia

Haiku »

Good Ideas Yes,
Sufficient time not so much,
Boat to the rescue

Team Yulia
Antonio Borges, Tom Bruning, Roberto Martin, Milen Pavlov, Yulia Pavlov

Responsive image

#CodeAcross Breaking News Award

Investigative Journalism through data

Project details »

Correlation between rider exits at Exchange place and NY Waterway ferry usage on weekends, also considering the imapct of inclement weather on rider usage. Future enhancements include: (1) include PATH ridership historical data for several years, broken by ride (2) add TAXI data for 2014 (Chris Whong only got 2013 numbers - http://chriswhong.com/open-data/foiling-nycs-boro-taxi-trip-data) (3) add MTA data at WFC (4) add holidays and events, like Thanksgiving

Team Ukko

Haiku »

Did not submit

Team Ukko
Gary Campbell, Jaime Chaves, Andres Contreras, Daniel Fernandez, Fernando Hernandez

Responsive image

#CodeAcross Data Mashup Award

Best use of multiple datasets

Project details »

The team analyzed correlation between PATH ridership data and inclement weather: How does weather impact PATH utilization? The turnstile data at 15 minute intervals was aggregated to match the weather underground data source daily information. The team made the following suggestions to further improve the analysis: (1) To create comprehensive events database with location and time, like when gov't closes trains, etc (2) PA to publish time series data store for historical alerts and (3) include sentiment analysis from twitter

Team Trio

Haiku »

Did not submit

Team Trio
Joanna Riascos, Leonardo Velázquez, Shagufta Zareen

Responsive image

#CodeAcross Where's Waldo Award

Best map visualization

Project details »

Visual simulation of the PATH usage based on station turnstile entrances + exits. Leveraged a priod open data vizualisation project as the foundation for this instance (http://chriswhong.com/open-data/visualizing-the-mtas-turnstile-data).

Team Capacity

Haiku »

Full capacity
Open Data makes Magic
The path to the PATH

Team Capacity
Christopher Castellano, Sydney Chan,Brian Evans, Kenneth Ford, Miraj Patel

Responsive image

#CodeAcross The Real World Award

Application solves a real problem


Responsive image

#CodeAcross Haiku Award

Best 7-5-7 syllable project elevator pitch

Project details »

Ridership Capacity modelling with consideration for special events (sports)

Team Path Not Taken

Haiku »

PATH is suspended
I cannot swim the Hudson
How do I get home?

Team Path Not Taken
Carla Astudillo, Jean Joseph, Victoria Nece, Nikhil Puri, Michalis Xyntarakis

Responsive image

#CodeAcross Inception Award

Most innovative mind-blowing project


Responsive image

#CodeAcross Humanitarian Award

Most impactful to the local community

Project details »

Impact on alternate trasnit methods by passenger redistribution when PATH service is interrupted. Questions raised: What is the closest alternative mode of transportation if the PATH train is suspended? Created heatmap showing am and pm max and average boarding times. Suggestion was made to create an app that will tell residents the backup plan in case of train outage (both location and time)

Team Quentin

Haiku »

Did not submit

Team Quentin
Quentin
Stenger

Responsive image

#CodeAcross Dev Ninja Award

Technical excellence in project


Responsive image

#CodeAcross Foundation Award

Greatest contribution others can use

Project details »

Cleaned the turnstile code, to make it readily available for other teams to use (file link).

Team Awesome

Haiku »

Data Science rules
Joining turnstiles to schedule
More insights to come...

Team Ukko
Charlie Bini, Yi Wang

Responsive image

#CodeAcross Matrix Award

Best visual representation of the data


Responsive image

#35CodeAcross Yo Mama Award

Settles the ‘Best City’ argument once and for all


Responsive image

#CodeAcross Peek-a-Boo Award

Best insight from the Data

Project details »

Using Tableau, mashed up PATH GTFS and turnstile data to create capacity and impact visualization for the Port Authority. Discoverd many people aren't accounted for exiting the system, as many passengers are using the handicap exit. After reaching to PA's Robert Galvin, the team obtained info about PATH train car capacity to be 129 people. Suggestions on how the proejct can be improved: (1) Compare to real esate and resulting population growth expecations aroudn Harrison, JSQ and Newark (2) Test true capacity of PATH train car - is it 129 people max or is it more? (3) Add alert data/event data to the analysis (4) Get historical metrocard/smart card data


Data

Data

For the PATH Data Jam, all projects are required to make direct use of the data, or reference an insight derived from one of the provided datasets.

# Source Link Description
1 Port Authority Turnstile Data

Turnstile Data (Sanitized)

StopID to Trurnstile Translation
9/1/2014-2/1/2015 (5 months) turnstile data at 15 minute interval for each station for exits and entrances

The Turnstile Data (Sanitized) file and the StopID to Trurnstile Translation file were created by Quentin Stenger during the event. Thank you Quentin!
2 Port Authority GTFS The General Transit Feed Specification (GTFS) defines a common format for public transportation schedules and associated geographic information. GTFS "feeds" allow public transit agencies to publish their transit data and developers to write applications that consume that data in an interoperable way.
3 Port Authority Alerts Alerts and Advisories
4 Port Authority Coming in 2016+ Live train location data
5 Regional Plan Association HubBound Trans Hudson Data
6 Regional Plan Association Census Tract Job Data This data is for 2000 and average of 2006-2010. It is produced by the census on the tract level and is broken down by industry. Because the Current Employment Statistics survey is considered the "Gold Standard" in job numbers, RPA adjusted the tract data to that level.
7 Regional Plan Association Codebook

Shape file
Discretionary Income: how much spending money do typical households in the New York metro region have left over after accounting for housing, transportation and taxes? The shape file contains data that is behind this Discretionary Income visualization.
8 Longitudinal Employer-Household Dynamics (LEHD) Origin-Destination Employment Statistics (LODES) Data files are state-based and organized into three types: Origin-Destination (OD), Residence Area Characteristics (RAC), and Workplace Area Characteristics (WAC), all at census block geographic detail. Data is available for most states for the years 2002–2011.RPA recommends using only 2011, as the data isn’t really comparable over time. It includes where people live, where they work on a block level, and the origin-destination pairs.
9 NY Waterway NY Waterway Weekend Ridership Summary

NY Waterway Weekend Ridership Data
Ridership data from February 2014-December 2014 when PATH Train service between Exchagne Place and WFC was suspended on weekends for upgrades of the switch signals. The NY Waterway provided weekend service betwen Exchange Place and WFC.
10 Other Other transportation data sets Other transportaion data links compiled for the 2014 Hackathon

Challenges

Ideas

We have defined several challenges for you. But you are welcome to bring your own project, start a new project or contact us with your suggestions at admin@openjerseycity.org.

1. How much strain will the PATH train services be under with the new population projections for the cities?

2. What is the impact of reduction/elimination of the overnight weekday train services?

3. What communities rely most on the PATH train services?

4. Create an interactive transit simulation based on schedule (GTFS) data for Jersey City and Newark by contributing to this project

5. Update Jersey City's transit information for the US City Open Data Census

6. Open Data Day challenge: create data scrapers to transform data into machine readible format that can be uploaded on open data portal. Upload the data on the JC Community Open Data Portal.


Tools

Tools

# Category Link Description
1 Analysis Python Widely used general-purpose, high-level programming language
2 Analysis Tableau Public Tool to explore and visualize data
3 Analysis OpenRefine For working with messy data: cleaning it and transforming it from one format into another
4 Analysis R Studio Programming language for statistical computing and graphics
5 Visualization Tableau Public Is a free service to explore and visualize data
6 Visualization D3.js JavaScript library used to create interactive graphical forms for web browsers
7 Visualization Fusion Tables Fusion Tables from Google is an experimental data visualization web application to gather, visualize, and share data tables.
8 Presentation Prezi Cloud-based presentation software and storytelling tool for presenting ideas
9 Presentation Google Slides An online collaborative presentations application that lets you create, edit, and deliver presentations
10 Presentation Apple Keynote Makes it simple to create and deliver beautiful presentations on mac
11 Presentation MSFT Office Makes it simple to create and deliver beautiful presentations on MS Windows
12 Presentation R Studio Programming language for statistical computing and graphics
13 Mapping Fusion Tables Fusion Tables from Google is an experimental data visualization web application to gather, visualize, and share data tables.
14 Mapping See list from 2014 Hackathon List of mapping tools

Program

We will provide data, speakers, wifi and food. Participants need to bring their laptops

Time Descripton
9:30-10:00am Doors Open with Pastries, Coffee and Juice
10:00-11:30am Speakers and Presentations
11:30-12:30pm Introduce the participants to each other so they can form teams and introduce challenges
12:30-4:00pm Break out into teams
12:30pm Lunch arrives
4:00-6:00pm Project submissions and presentation
6:00-7:00pm Networking and happy hour

Partners


Sponsors