Frequently Asked
Questions
DATA2GO.NYC
General
What is DATA2GO.NYC?
DATA2GO.NYC
is a free, easy-to-use online mapping and data tool created by Measure of
America, an initiative of the nonprofit Social Science Research Council, with
the support of the Leona M. and Harry B. Helmsley Charitable Trust.
DATA2GO.NYC
brings together for the first time federal, state, and city data on a broad
range of issues critical to the well-being of all New Yorkers. The website
includes over 300 indicators for New York City’s fifty-nine community
districts; 150 of these indicators are also available by census tract. Many of
these indicators were previously unavailable to the public.
Who funded this project?
DATA2GO.NYC
was funded by The Leona M. and Harry B. Helmsley Charitable Trust as the first
project in their New
York City Regional Grants Program, launched in early 2014. This new program seeks to identify and solve specific problems to help
the region’s residents meet basic needs for themselves and their families,
maintain economic security, and access public goods and services that improve
economic well-being, security, and quality of life.
New York
City and many organizations already make a great deal of data public. What
value does DATA2GO.NYC add?
Lots
of great online data tools exist. But many are the product of one-time-only
efforts, focus on just one issue or sector, or are tricky for regular people to
navigate. In addition, different indicators are available using different
geographic boundaries, time periods, definitions, interfaces, and
methodologies. This variety makes comparisons very challenging. DATA2GO.NYC is:
• Intuitive and easy-to-use. It’s designed
to provide access to official government data for “data civilians”—people who
need accurate, up-to-date information but lack advanced statistical and
research training. All data are presented by NYC community district, and some
150 indicators are available by both community district and census tract.
• Comprehensive and multi-sectoral.
DATA2GO.NYC includes data from
twelve different sectors, including health, education, environment, political
engagement, and public safety, covering a wide range of issues that affect the
well-being of everyday New Yorkers and shape the choices and opportunities open
to them.
• Informed by a holistic vision. A unique
blend of datasets, interactive maps, and analytical tools links data on human
need, resources, and outcomes and helps users see social and economic realities
in the interconnected way in which people actually experience them.
• Sustainable. DATA2GO.NYC provides a snapshot of current
realities alongside a commitment to update data regularly for at least five
years.
• Transparent and rigorous. DATA2GO.NYC is an impartial, independent source
for methodologically sound, reliable, fully sourced data.
• Customizable. DATA2GO.NYC allows users to tailor the set of indicators
they see on their screens based on the issues that are important to them.
• Useful to experts. DATA2GO.NYC
allows the pros to access underlying data and primary source material in
sortable spreadsheet form with detailed sourcing and methodological notes.
Who is DATA2GO.NYC for?
Anyone can use DATA2GO.NYC! It is a free website available to everyone with a passion for
understanding and improving the amazing city we call home. It was, however,
designed with the needs of these users in mind:
• Decision-makers in and outside NYC government, who can access a
comprehensive picture of human assets and needs, neighborhood by neighborhood.
• Foundations,
philanthropists, and service delivery organizations, which can pinpoint need and service gaps, establish
baselines, and track change to gauge the impact of their investments and
programs.
• Communities
and community-based organizations,
which can create and print customized maps, data visualizations, and
comparisons for program development, fundraising, and advocacy without
expending resources on costly and time-consuming in-house data collection and
analysis.
• Teachers
and students, who can easily access data for analyzing their communities, how their
communities compare to others, and how issues interact to shape people’s
everyday lives and long-term prospects.
• Researchers, who can download clean, well-sourced, accurate
datasets for hundreds of indicators.
What are some practical uses for DATA2GO.NYC?
DATA2GO.NYC
has virtually unlimited possibilities! Here are a few examples of how people
might use the tool:
• Health service providers could investigate the CONNECTIONS between a specific health disease or ailment such as
asthma or diabetes and various neighborhood conditions that might contribute to
it to ensure that their programming addresses root causes and to inform public
information campaigns.
• Community organizers could go to DASHBOARDS,
select relevant indicators of well-being, and present them at a community
meeting to support their case for shifting priorities or creating new programs.
• Staff at nonprofit organizations working on food security could use MAPS to see where the greatest needs
for their services are. They could use DASHBOARDS
to see what other challenges high-food-insecurity neighborhoods face. With this
information, they could identify potential partners for joint outreach and
programming.
• NYC government agencies could use the data to inform policy-making and
ensure that services reach the most vulnerable; elected officials could use the
resource to better understand the needs of their constituents.
• Community-based organizations could download infographics and maps
related to their service area to include with grant applications or public
information materials.
• Teachers could use the data to better understand out-of-classroom
challenges their students face.
• Service-delivery organizations could use CORRELATIONS to identify possible programming priorities. For instance,
a strong relationship exists between the share of the population receiving SNAP
benefits and the share of the population eating too few fruits and vegetables;
this finding might suggest that assistance to people receiving SNAP should aim
to boost fruit and vegetable consumption by improving access, lowering cost, or
providing nutrition education.
Measure of America released several stories
concurrently with DATA2GO.NYC to show
the kinds of analysis the tool allows. One looks at neighborhood factors
associated with high levels of obesity, another analyzes the prevalence of
foreclosure by community district and race, and a third explores links between
income and health behaviors. These stories illustrate the power of DATA2GO.NYC to provide new insights for better,
more effective policy. Read the stories here.
How were the indicators chosen from all the datasets available?
DATA2GO.NYC focuses on human well-being. Given
the myriad factors that affect people from the day they are born until the day
they die—from living conditions and safety to schools and jobs to health and
family structure—we curated the dataset behind DATA2GO.NYC to be as expansive as possible within a well-being framework.
If an indicator matters for people’s ability to live freely chosen, rewarding
lives and we could access that indicator, we added it. If you see something
missing, tell us: contact@measureofamerica.org. Please keep in
mind that we are able to include only those data sets that will be collected
regularly and meet a high standard of accuracy, rigor, and transparency.
Can I add my own indicators to DATA2GO.NYC?
We love crowdsourcing, too! But DATA2GO.NYC is intended as a reliable resource
containing only the most accurate, up-to-date information. We would, however,
love to hear your ideas for new indicators to include. If the data are collected by a public
entity, university, or other provider of high-quality data and are either
already available by community district or can be geocoded by community
district, we would be happy to consider adding them to DATA2GO.NYC. Please email us at contact@measureofamerica.org with your suggestion.
How should students and researchers cite DATA2GO.NYC in projects and
papers?
Thanks for asking. The suggested attribution is:
Measure of
America, Social Science Research Council. 2015. DATA2GO.NYC. www.data2go.nyc.
Are there plans to adapt DATA2GO.NYC for other cities?
We would love to
take DATA2GO to other cities. We encourage foundations, business groups,
government entities, and others interested in a DATA2GO for their town to get
in touch with us at contact@measureofamerica.org.
Who created DATA2GO.NYC?
DATA2GO.NYC was created by Measure of America, a project of the
Social Science Research Council, Codie See, and Rosten
Woo.
Measure
of America provides easy-to-use yet methodologically sound tools for
understanding well-being and opportunity in America. Through reports,
custom-built dashboards, interactive online maps and tools, and evidence-based
analysis, Measure of America works with partners to breathe life into numbers, using
data to identify areas of need, pinpoint levers for change, and track progress
over time. Measure of America’s hallmark is the American Human
Development Index, a composite measure of health, education, and earnings
indicators based on a time-tested model developed at the United Nations and
used in countries worldwide. Cofounders Sarah Burd-Sharps
and Kristen Lewis founded Measure of America in 2007, and Measure of America
joined the
Social Science Research Council (SSRC) in 2008. For more information, please see our website at www.measureofamerica.org.
Rosten Woo is an artist, designer, and writer living in Los
Angeles. He produces artworks and educational projects about complicated
systems and group decision-making. Visit rostenwoo.biz.
An Advisory Committee of
twenty-five leading NYC policy-makers and thinkers has been involved from the
start to make sure Data2Go builds on, rather than duplicates, existing tools
and that it benefits from expertise in every sector. These advisors came from
numerous city agencies and offices of elected officials; from the city’s
leading universities; from Philanthropy New York, the Robin Hood Foundation,
and other philanthropic organizations; and from City Harvest and other leading
civil society and community-based organizations.
What is the American Human Development Index?
One of the
indicators on DATA2GO.NYC is the American Human
Development Index. The American Human Development Index is a measure
of well-being and opportunity that is made up of health, education, and
earnings indicators and expressed on a scale of 0 to 10; 10 is the top score. A
supplement to economic metrics like Gross Domestic Product, this index is a
modification of the United Nations’ Human Development Index, which has become
the global gold standard for assessing human progress, well-being, and freedom
in the twenty years since it was first introduced. The index rests on a strong
conceptual framework: Nobel Laureate Amartya Sen’s capabilities approach.
Maps
Should I view MAPS in full screen mode?
Yes. The MAPS view works best in full screen
mode, or at least in a window that covers most of the screen. In a smaller
window, some of the functionality is lost.
What geographic area is covered by the MAPS section in DATA2GO.NYC?
DATA2GO.NYC covers all areas within the city
limits of New York City, comprising the five boroughs of the Bronx, Brooklyn,
Manhattan, Queens, and Staten Island.
What kinds of data are available in the MAPS section of DATA2GO.NYC?
Two kinds of
data are available in the MAPS section
of DATA2GO.NYC: geographic-area data
and facility-location data.
Geographic-area data are available for the fifty-nine community districts of NYC and, for a
smaller subset of DATA2GO indicators, for most of the 2,168 NYC census tracts
designated by the US Census Bureau. To get area data, click on “select an
indicator.” For example, if you click the indicator “median personal earnings,”
you will see represented on the map the median earnings (in dollars) of all
workers ages 16 and up for each census tract in NYC. If you click “smoking,”
you will see the percentage of adults in each community district who currently
smoke. The MAPS section shows the indicator at the smallest geographic unit for
which the data are available, community district for most and census tract for
a smaller group.
Facility-location data tell you information about a specific facility—for example, a school, a
library, a park, or a waste facility. Of the seventeen features available,
nearly all are operated, funded, licensed, or certified by a government agency.
So the schools, colleges, and universities do not include private or for-profit
schools (though they do include charter schools), and the supermarkets and
waste facilities are all government licensed or certified. These data were all
obtained from the licensing source, and that source is accessible by clicking
the i in the data box. Of these thousands of
records, we expect that, in some cases, records may be incorrect or outdated.
We welcome corrections. Email us at contact@measureofamerica.org.
To get these
data, click on “Features of Interest” to get a dropdown menu. When you click on
a Feature of Interest category, a dot for each facility of that type will
appear on the map. To learn more about a specific facility, click the
individual dot, and its name and information about it will appear in a box on
the lower right part of the screen. For example, if you want to know more about
CUNY City College, choose the “colleges and universities” category, then click
the dot that represents its location in upper Manhattan; detailed information
on the school, including tuition, graduation rates, and demographic data, will
appear in the box. The source of these data can be obtained by clicking on the i.
Why choose community districts as the main unit of analysis?
The community-district
level was chosen for DATA2GO following consultations with the project’s
Advisory Committee and interviews with potential end-users. The consensus was
that this is the best unit of analysis available today, given the many
constraints on data collection, for the intended uses of DATA2GO. While there
is, of course, tremendous variation in well-being outcomes within each of these
fifty-nine community districts, data analysis by community district opens up
access to a whole set of data collected through the US Census Bureau’s annual
American Community Survey, because the Census-designated Public Use Microdata
Areas (PUMAs) for NYC map almost exactly onto the city’s community districts.
See more on PUMAs below for details.
Census tracts are an excellent geographic unit of
analysis for understanding variation, but because they average only about 4,000
people, the numbers can get too small for reliable estimates. This is
especially true when you are looking at a subset of these 4,000 residents, for
example, measuring something related only to children under five years of age.
As a result, data for over half of DATA2GO.NYC’s
indicators are not available by census tract. Where we were able to obtain
tract-level data, nearly all of the tract-level indicators in DATA2GO are
five-year estimates, meaning that sixty months of data have been combined
together. Though the data are less timely, this approach is a common best
practice with small sample sizes in order to increase greatly the precision and
accuracy of the data.
Dashboards
What are
DASHBOARDS?
The DASHBOARDS
section of DATA2GO.NYC lets
users select a unique set of indicators for a specific community district as
well as compare multiple districts on those indicators. The DASHBOARDS section features data
visualization modules that make the data behind DATA2GO.NYC clear, informative, and ready for presentation. Examples of the
type of information within these modules include racial and ethnic composition,
age and gender distribution, and levels of educational attainment, to name a
few. Users can share their customized DASHBOARD
via social media or print a hard copy to present at a conference.
Can I
save my DASHBOARD?
Each customized Dashboard view has been designed to
have its own unique URL. This enables you to return to this view whenever you
want, save it as a printable file, or share it via social media.
Should I view DASHBOARDS in full screen mode?
Yes. The DASHBOARDS view works best in full
screen mode, or at least in a window that covers most of the screen. In a
smaller window, some of the functionality is lost.
Sometimes the indicator percentages don’t add up to 100%; why is that?
We rounded most indicators; as a result, the
percentages for a handful of indicators don’t sum to 100%.
Connections
What are CONNECTIONS?
The CONNECTIONS
section of DATA2GO.NYC enables users
to do two things: explore the correlation between two variables across NYC’s
fifty-nine community districts and examine how individual community districts
stack up when compared with other city districts on the given variables. The CONNECTIONS page asks and answers the
following question: “Is there a correlation between variable x and variable y,
and if so, how strong is the relationship?”
What is a correlation?
A correlation describes the relationship between two variables. Correlations can be very useful in understanding whether, and to what
extent, one variable relates to another. Identifying a correlation is often a
first step leading to further research.
The
CONNECTIONS section of DATA2GO.NYC
uses Pearson’s correlation coefficient, which measures the linear relationship
between two variables. Pearson’s correlation coefficient yields a number
ranging from –1 to +1 and is represented with the symbol r. If r is positive, the
linear relationship between the two variables is positive, meaning that as one
variable increases across the fifty-nine NYC districts, the other also tends to
increase. If r is negative, then as
one variable increases, the other tends to decrease. The further r is from zero (in either a positive or
negative direction), the stronger the relationship between the two variables.
For example, the correlation between the indicators
“Median Household Income (2013 $)” and “Completed at Least a Bachelor’s Degree
(% of adults 25+)” has an r value of
0.88, meaning that, across NYC’s fifty-nine community districts, these two
variables have a strong, positive correlation. In plain English we can say that
as median household income increases, the share of adults with at least a
bachelor’s degree tends to increase. There is a strong relationship between
these two variables.
If I find
a correlation between two variables and want to investigate this further, where
can I go?
We highly encourage our users to dive deeper
analytically than the information available in the CONNECTIONS section. For those wishing to further explore specific
indicators, sourcing information is available in great detail in the
downloadable dataset, including source URLs.
What
indicators are available in the CONNECTIONS section of DATA2GO.NYC? What if I
want to see connections between indicators not listed in CONNECTIONS?
CONNECTIONS includes a carefully selected set of x
variables, each of which can be paired with a unique set of y variables. By
clicking on the dots on the graph, you can obtain the (x,y) coordinates for every NYC community district. You
can also compare two sets of coordinates by clicking on one dot and then
passing your cursor over a second dot.
This section includes a subset of DATA2GO.NYC indicators. These indicators were
selected based on the social science literature on these topics that shows that
these relationships are important to explore for well-being. Researchers interested
in exploring the full range of correlations can access the complete DATA2GO.NYC datasets by clicking the “download”
tab.
Are there any issues in comparing data in the CONNECTIONS section?
While all the data were the most recent available
in the summer of 2015, the original data sources conduct their studies with
differing frequency. This should not, however, undermine the value of the
analysis for pinpointing need, identifying areas for further research,
developing fact-based solutions, and more.
What conclusions can be drawn from the CONNECTIONS section?
The Pearson’s correlation describes the linear
relationship between two variables. But it does not prove that a change in one
variable causes a change in the other
variable. For example, ice cream sales and murder rates tend to rise in tandem
in places with hot summers. Does that mean that ice cream consumption makes
people murderous, or that murder makes people crave ice cream? Of course not.
Research shows that they both rise in the summer months because warm weather
makes ice cream a particularly appealing treat and summer is a time when people
are more likely to get together and to be outside, where they come into greater
contact with one another. Determining causal relationships requires extensive
research and subject-matter expertise.
Should I view CONNECTIONS in full screen mode?
Yes. The CONNECTIONS view works best in full
screen mode, or at least in a window that covers most of the screen. In a smaller
window, some of the functionality is lost.
Data and Sourcing
Can I download the data on DATA2GO.NYC?
Yes! We have made nearly the entire dataset behind
DATA2GO.NYC available to the public.
Please find the Download tab and select either .xls
or .csv format. The only data that are not available for download are the data
obtained by special agreement with the Foundation Center. To see more data about philanthropy, explore Foundation Maps, a service
of the Foundation Center here.
How were the data sets available in DATA2GO.NYC produced?
Measure of America staff undertook nearly a year of
work, involving negotiations, calculations, geocoding, statistical exercises,
and number-checking, to arrive at a user-friendly dataset. DATA2GO.NYC was built with the needs and skills
of a data layperson in mind. While experts can put raw data into forms that
they can analyze, most people don’t have the skills or software for this task.
In DATA2GO.NYC, much of the raw data
were converted into rates to make the indicator more meaningful for analysis
and more reliable for comparisons. It’s not very useful to compare the number
of robberies in one community district to those in another when the total
population size in each is very different. For this, a rate per 1,000 residents
does the trick. All voting and New York Police Department data were geocoded
from addresses provided by special agreement in order to present them by
community district. Some of the data were manipulated to present them in a more
accessible format. The NYC Department of Sanitation, for example, provides data
on tons of waste per person per month. This number is infinitesimally small but
if converted into annual waste data, is much easier to understand and use. Some
indicators are calculated by Measure of America, such as the disconnected youth
rate and the American Human Development Index score. For questions regarding
specific indicators, please ask us at contact@measureofamerica.org.
Where do
the data come from?
Most of the indicators available in DATA2GO.NYC are from federal, state, or NYC
public sources. A few were obtained from nonprofit sources such as the Food Bank
for New York City and the NYU Furman Center. The source of each and every
indicator in DATA2GO.NYC can be
found by clicking INFO in MAPS or the “i” near the indicator name in DASHBOARDS and in the
Features of Interest. These institutions use rigorous sampling techniques,
often sampling over several years to ensure a more accurate estimate.
What is a Census Tract?
Census
Tracts are small, relatively permanent
subdivisions of a county (or NYC borough) that are updated prior to each
decennial census, which is conducted by the US Census Bureau. Tracts have a
population size between 1,200 and 8,000 people, with an optimum size of 4,000
people. Tracts usually cover a contiguous area and generally follow
identifiable features.
Can you compare data from different time
periods—for example, data that samples just 2013 to data that samples
2011–2013?
Ideally, all
data in DATA2GO.NYC would sample the
exact same time period. However, it is not realistic to expect the many different agencies and organizations across the wide variety
of sectors included in DATA2GO.NYC to
conduct uniform survey data collection. DATA2GO.NYC was created in part to provide these data in one geographical
unit to enable such comparisons. All data included are the most recent
available as of the summer of 2015. While there may be some minor
methodological issues comparing slightly different periods of sampling, we
believe that these potential issues are far outweighed by the power of uniform
geographies within DATA2GO.NYC.
Geographies
What geographic areas are available in DATA2GO.NYC?
Where possible, we’ve pulled area data for four
distinct geographic levels:
• NYC
• The five boroughs
• Fifty-nine community districts OR fifty-five Public Use
Microdata Areas
• 2,168 Census Tracts
City- and borough-level data are available from the
wide variety of sources found in DATA2GO.NYC.
The community district-level data primarily come from city agencies. PUMA- and
census-tract-level data primarily come from the US Census Bureau.
What are community districts?
Community districts correspond to community
planning boards. The fifty-nine NYC community districts roughly line up with
neighborhoods or groups of neighborhoods, and NYC community boards have limited
authority over planning and some types of local expenditure within them. The
names of neighborhoods within community districts are not officially
designated, and the names used in DATA2GO.NYC
do not include all known neighborhood names within the various districts. City Council
districts are separate geographical units. They are not present in DATA2GO.NYC. Please see http://www.NYC.gov/html/dcp/html/neigh_info/nhmap.shtml for more information.
Twelve areas fall outside community district
boundaries, comprising the two NYC airports and ten large parks/recreation
areas, including Central Park and Flushing Meadows. For these areas, called
“Joint Interest Areas” by the Department of City Planning of NYC, no data are
available. For details, please see http://www.NYC.gov/html/dcp/html/lucds/cdstart.shtml.
What are Public Use Microdata Areas (PUMAs)? Are they the same as
community districts?
Public Use Microdata Areas (PUMAs) are
Census Bureau–defined geographies that generally correspond to the fifty-nine
community districts of NYC. PUMAs roughly standardize the population of each
community district to enable more accurate comparisons between these areas.
However, eight community districts’ populations are too small, so the Census
Bureau has combined them into four separate PUMAs. These combined community
districts comprise the following PUMAs: Manhattan 1 & 2, Manhattan 4 &
5, Bronx 1 & 2, and Bronx 3 & 6. To see how these geographies interact,
please see http://www.NYC.gov/html/dcp/pdf/census/puma_cd_map.pdf.
For PUMAs that comprise two community
districts, when percentage or rate data were only available at the PUMA level,
it is assumed that these percentages or rates hold constant for both community
districts. For instance, in Bronx District 1 and District 2, the percentage of
the population that is Latino is 70.4 percent. It is thus assumed that Bronx
Community District 1 is 70.4 percent Latino and that Bronx CD 2 is 70.4 percent
Latino. This is important to note when comparing community district data to
PUMA data.
Within DATA2GO.NYC there are several indicators that describe change over time.
The ideal goal was to look at data from 2000 as the first year for the change.
However, for a variety of technical reasons, mostly stemming from the fact that
the US Census Bureau did not sample PUMA-level data in a way that was
representative of all states until 2005, PUMA data is sourced from three-year
samples from 2005 to 2007. Reliable PUMA-level data were not available for
earlier years.
Can I make comparisons between Public Use Microdata Area and
community-district-level data?
Yes! It is important to note the
difference between the four PUMAs that comprise two community districts each,
as discussed above, but otherwise we hope that new and interesting connections are
found between these similar geographical units.
How are Census Tracts used in DATA2GO.NYC? Are they
reliable?
Census Tract-level data are available for indicators that come
from the US Census Bureau (Decennial Census 2000 or 2010 and the annual American
Community Survey). There are 2,168 Census Bureau–designated census tracts in
New York City. For accurate analysis, tracts with total populations under 100
people (or no people at all) are suppressed entirely from the dataset. They are
notated with gray on the maps. Suppressed tracts include bodies of water,
industrial areas, cemeteries, or parkland.