Access the full text.
Sign up today, get DeepDyve free for 14 days.
A. Albert, Jasleen Kaur, Marta González (2017)
Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large ScaleProceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
E. Ziegel (2003)
The Elements of Statistical LearningTechnometrics, 45
(2018)
Health Effects of Overweight and Obesity in 195 Countries over 25 Years
B. Saelens, J. Sallis, Jennifer Black, Diana Chen (2003)
Neighborhood-based differences in physical activity: an environment scale evaluation.American journal of public health, 93 9
R. Ewing, R. Brownson, D. Berrigan (2006)
Relationship between urban sprawl and weight of United States youth.American journal of preventive medicine, 31 6
K. Morland, A. Roux, S. Wing (2006)
Supermarkets, other food stores, and obesity: the atherosclerosis risk in communities study.American journal of preventive medicine, 30 4
E. Nsoesie, P. Butler, Naren Ramakrishnan, S. Mekaru, J. Brownstein (2015)
Monitoring Disease Trends using Hospital Traffic Data from High Resolution Satellite Imagery: A Feasibility StudyScientific Reports, 5
C. Durand, M. Andalib, G. Dunton, J. Wolch, M. Pentz (2011)
A systematic review of built environment factors related to physical activity and obesity risk: implications for smart growth urban planningObesity Reviews, 12
K. Booth, Megan Pinkston, W. Poston (2005)
Obesity and the built environment.Journal of the American Dietetic Association, 105 5 Suppl 1
Yi Yang, S. Newsam (2010)
Bag-of-visual-words and spatial extensions for land-use classification
S. Casagrande, M. Whitt-Glover, K. Lancaster, A. Odoms-Young, T. Gary (2009)
Built environment and health behaviors among African Americans: a systematic review.American journal of preventive medicine, 36 2
R. Merrill, J. Richardson (2009)
Validity of Self-Reported Height, Weight, and Body Mass Index: Findings from the National Health and Nutrition Examination Survey, 2001-2006Preventing Chronic Disease, 6
Sanae Inagami, D. Cohen, B. Finch, S. Asch (2006)
You are where you shop: grocery store locations, weight, and neighborhoods.American journal of preventive medicine, 31 1
D. Marmanis, M. Datcu, T. Esch, Uwe Stilla (2016)
Deep Learning Earth Observation Classification Using ImageNet Pretrained NetworksIEEE Geoscience and Remote Sensing Letters, 13
Li (2009)
Built environment and 1-year change in weight and waist circumference in middle-aged and older adults: Portland Neighborhood Environment and Health Study.Am J Epidemiol, 169
A. Razavian, Hossein Azizpour, Josephine Sullivan, S. Carlsson (2014)
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops
A. Esteva, Brett Kuprel, R. Novoa, J. Ko, S. Swetter, H. Blau, S. Thrun (2017)
Dermatologist-level classification of skin cancer with deep neural networksNature, 542
R. Ewing, T. Schmid, R. Killingsworth, Amy Zlot, S. Raudenbush (2003)
Relationship between Urban Sprawl and Physical Activity, Obesity, and MorbidityAmerican Journal of Health Promotion, 18
W. Zheng, D. McLerran, Betsy Rolland, Xianglan Zhang, M. Inoue, K. Matsuo, Jiang He, P. Gupta, K. Ramadas, S. Tsugane, F. Irie, A. Tamakoshi, Yu-Tang Gao, Renwei Wang, X. Shu, I. Tsuji, S. Kuriyama, Hideo Tanaka, H. Satoh, Chien-Jen Chen, Jian-Min Yuan, K. Yoo, H. Ahsan, W. Pan, D. Gu, M. Pednekar, C. Sauvaget, S. Sasazuki, T. Sairenchi, Gong Yang, Y. Xiang, M. Nagai, Takeshi Suzuki, Y. Nishino, S. You, W. Koh, Sue-Kyung Park, Yu Chen, Chen-Yang Shen, M. Thornquist, Ziding Feng, D. Kang, P. Boffetta, J. Potter (2011)
Association between body-mass index and risk of death in more than 1 million Asians.The New England journal of medicine, 364 8
G. McCormack, B. Giles-Corti, Andrea Lange, Terry Smith, Karen Martin, T. Pikora (2004)
An update of recent evidence of the relationship between objective and self-report measures of the physical environment and physical activity behaviours.Journal of science and medicine in sport, 7 1 Suppl
Ken Chatfield, K. Simonyan, A. Vedaldi, Andrew Zisserman (2014)
Return of the Devil in the Details: Delving Deep into Convolutional NetsArXiv, abs/1405.3531
T. Jafar, Muhammad Islam, N. Poulter, J. Hatcher, C. Schmid, A. Levey, N. Chaturvedi (2005)
Children in South Asia Have Higher Body Mass–Adjusted Blood Pressure Levels Than White Children in the United States: A Comparative StudyCirculation, 111
Fuzhong Li, P. Harmer, B. Cardinal, M. Bosworth, Deb Johnson-Shelton, Jane Moore, A. Acock, Naruepon Vongjaturapat
American Journal of Epidemiology Original Contribution Built Environment and 1-year Change in Weight and Waist Circumference in Middle-aged and Older Adults Portland Neighborhood Environment and Health Study
Craig Hales, M. Carroll, C. Fryar, Cynthia Ogden (2017)
Prevalence of Obesity Among Adults and Youth: United States, 2015-2016.NCHS data brief, 288
(2017)
The State of Obesity
Fuzhong Li, P. Harmer, B. Cardinal, M. Bosworth, A. Acock, Deb Johnson-Shelton, Jane Moore (2008)
Built environment, adiposity, and physical activity in adults aged 50-75.American journal of preventive medicine, 35 1
L. Dwyer-Lindgren, Greg Freedman, Rebecca Engell, T. Fleming, Stephen Lim, C. Murray, A. Mokdad (2013)
Prevalence of physical activity and obesity in US counties, 2001–2011: a road map for actionPopulation Health Metrics, 11
A. Rundle, K. Neckerman, L. Freeman, G. Lovasi, Marnie Purciel, J. Quinn, Catherine Richards, N. Sircar, Christopher Weiss (2008)
Neighborhood Food Environment and Walkability Predict Obesity in New York CityEnvironmental Health Perspectives, 117
L. Maaten, Geoffrey Hinton (2008)
Visualizing Data using t-SNEJournal of Machine Learning Research, 9
(2016)
Overweight & Obesity: Adult Obesity Facts
G. Lovasi, M. Hutson, Mónica Guerra, K. Neckerman (2009)
Built environments and obesity in disadvantaged populations.Epidemiologic reviews, 31
Alexia Kelly-Schwartz, J. Stockard, S. Doyle, Marc Schlossberg (2004)
Is Sprawl Unhealthy?Journal of Planning Education and Research, 24
K. Moore (2020)
Healthy PlacesEnquiry The ARCC Journal for Architectural Research
L. Mobley, E. Root, E. Finkelstein, Olga Khavjou, R. Farris, J. Will (2006)
Environment, obesity, and cardiovascular disease risk in low-income women.American journal of preventive medicine, 30 4
Russ Lopez (2004)
Urban sprawl and risk for being overweight or obese.American journal of public health, 94 9
A. Kaczynski, K. Henderson (2008)
Parks and recreation settings and active living: a review of associations with physical activity function and intensity.Journal of physical activity & health, 5 4
Javier López-Zetina, Howard Lee, R. Friis (2006)
The link between obesity and the built environment. Evidence from an ecological analysis of obesity and vehicle miles of travel in California.Health & place, 12 4
T. Rex (2010)
A practical guide to the American Community Survey (5-year estimates)
Tim Althoff, R. Sosič, J. Hicks, A. King, S. Delp, J. Leskovec (2017)
Large-scale physical activity data reveal worldwide activity inequalityNature, 547
P. Gordon-Larsen, Melissa Nelson, P. Page, B. Popkin (2006)
Inequality in the Built Environment Underlies Key Health Disparities in Physical Activity and ObesityPediatrics, 117
Obesity rates & trends-adult obesity in the United States
R. Sturm, Ashlesha Datar (2005)
Body mass index in elementary school children, metropolitan area food prices and food outlet density.Public health, 119 12
N. Humpel, N. Owen, E. Leslie (2002)
Environmental factors associated with adults' participation in physical activity: a review.American journal of preventive medicine, 22 3
Melissa Nelson, P. Gordon-Larsen, Yan Song, B. Popkin (2006)
Built and social environments associations with adolescent overweight and activity.American journal of preventive medicine, 31 2
Q. Nguyen, Mehdi Sajjadi, M. McCullough, Minh Pham, Thu Nguyen, Weijun Yu, H. Meng, M. Wen, Feifei Li, Ken Smith, Kimberly Brunisholz, T. Tasdizen (2018)
Neighbourhood looking glass: 360º automated characterisation of the built environment for neighbourhood effects researchJournal of Epidemiology and Community Health, 72
J. Black, J. Macinko (2008)
Neighborhoods and obesity.Nutrition reviews, 66 1
S. Mohanty, David Hughes, M. Salathé (2016)
Using Deep Learning for Image-Based Plant Disease DetectionFrontiers in Plant Science, 7
B. Giles-Corti, S. Macintyre, J. Clarkson, T. Pikora, R. Donovan (2003)
Environmental and Lifestyle Factors Associated with Overweight and Obesity in Perth, AustraliaAmerican Journal of Health Promotion, 18
R. Jeffery, J. Baxter, M. McGuire, J. Linde, Judy, Maureen Jennifer
International Journal of Behavioral Nutrition and Physical Activity Are Fast Food Restaurants an Environmental Risk Factor for Obesity?
A. Karnieli, U. Gilad, M. Ponzet, T. Svoray, R. Mirzadinov, O. Fedorina (2008)
Assessing land-cover change and degradation in the Central Asian deserts using satellite image processing and geostatistical methodsJournal of Arid Environments, 72
S. Doyle, Alexia Kelly-Schwartz, Marc Schlossberg, J. Stockard (2006)
Active Community Environments and Health: The Relationship of Walkable and Safe Communities to Individual HealthJournal of the American Planning Association, 72
J. Maddock (2004)
The Relationship between Obesity and the Prevalence of Fast Food Restaurants: State-Level AnalysisAmerican Journal of Health Promotion, 19
Sujoy Ghosh, C. Bouchard (2017)
Convergence between biological, behavioural and genetic determinants of obesityNature Reviews Genetics, 18
L. Frank, Martin Andresen, T. Schmid (2004)
Obesity relationships with community design, physical activity, and time spent in cars.American journal of preventive medicine, 27 2
A. Ellaway, S. Macintyre, X. Bonnefoy (2005)
Graffiti, greenery, and obesity in adults: secondary analysis of European cross sectional surveyBMJ : British Medical Journal, 331
M. Kuczmarski, R. Kuczmarski, M. Najjar (2001)
Effects of age on validity of self-reported height, weight, and body mass index: findings from the Third National Health and Nutrition Examination Survey, 1988-1994.Journal of the American Dietetic Association, 101 1
Gong Cheng, Junwei Han, Xiaoqiang Lu (2017)
Remote Sensing Image Scene Classification: Benchmark and State of the ArtProceedings of the IEEE, 105
(2016)
500 Cities: Local Data for Better Health
A. Afshin, M. Forouzanfar, M. Reitsma, P. Sur, Kara Estep, Alex Lee, L. Marczak, A. Mokdad, M. Moradi-Lakeh, M. Moradi-Lakeh, M. Naghavi, Joseph Salama, T. Vos, K. Abate, C. Abbafati, M. Ahmed, Ziad Al-Aly, A. Alkerwi, Rajaa Al-Raddadi, Azmeraw Amare, A. Amberbir, A. Amegah, E. Amini, S. Amrock, R. Anjana, Johan Ärnlöv, Johan Ärnlöv, H. Asayesh, Amitava Banerjee, A. Barać, Estifanos Baye, Derrick Bennett, A. Beyene, S. Biadgilign, S. Biryukov, E. Bjertness, D. Boneya, Ismael Campos-Nonato, J. Carrero, P. Cecílio, Kelly Cercy, Liliana Ciobanu, Leslie Cornaby, S. Damtew, L. Dandona, L. Dandona, R. Dandona, R. Dandona, S. Dharmaratne, Bruce Duncan, B. Eshrati, A. Esteghamati, V. Feigin, Valery Feigin, João Catalão, Thomas Fürst, T. Gebrehiwot, Audra Gold, P. Gona, P. Gona, P. Gona, Atsushi Goto, T. Habtewold, K. Hadush, N. Hafezi-Nejad, S. Hay, M. Horino, F. Islami, R. Kamal, A. Kasaeian, S. Katikireddi, A. Kengne, C. Kesavachandran, Yousef Khader, Y. Khang, J. Khubchandani, J. Khubchandani, Daniel Kim, Y. Kim, Y. Kinfu, S. Kosen, Tiffany Ku, B. Defo, Anil Kumar, Heidi Larson, M. Leinsalu, Xiaofeng Liang, Stephen Lim, Patrick Liu, A. Lopez, Rafael Lozano, Azeem Majeed, Azeem Majeed, R. Malekzadeh, D. Malta, M. Mazidi, C. McAlinden, S. Mcgarvey, D. Mengistu, George Mensah, Gert Mensink, H. Mezgebe, E. Mirrakhimov, U. Mueller, J. Noubiap, C. Obermeyer, F. Ogbo, M. Owolabi, George Patton, F. Pourmalek, Mostafa Qorbani, Anwar Rafay, R. Rai, C. Ranabhat, Nikolas Reinig, S. Safiri, Joshua Salomon, J. Sanabria, I. Santos, B. Sartorius, M. Sawhney, J. Schmidhuber, J. Schmidhuber, A. Schutte, M.I. Schmidt, S. Sepanlou, Moretza Shamsizadeh, S. Sheikhbahaei, Min-Jeong Shin, Min-Jeong Shin, R. Shiri, I. Shiue, H. Roba, H. Roba, D. Silva, Jonathan Silverberg, Jasvinder Singh, Jasvinder Singh, S. Stranges, Soumya Swaminathan, R. Tabarés-Seisdedos, Fentaw Tadese, B. Tedla, B. Tegegne, A. Terkawi, J. Thakur, M. Tonelli, R. Topor-Madry, S. Tyrovolas, K. Ukwaja, O. Uthman, Masoud Vaezghasemi, T. Vasankari, V. Vlassov, S. Vollset, E. Weiderpass, A. Werdecker, Joshua Wesana, R. Westerman, Yuichiro Yano, N. Yonemoto, G. Yonga, Z. Zaidi, Z. Zenebe, B. Zipkin, C. Murray (2017)
Health Effects of Overweight and Obesity in 195 Countries over 25 Years.The New England Journal of Medicine, 377
R. Team (2014)
R: A language and environment for statistical computing.MSOR connections, 1
M. Papas, A. Alberg, R. Ewing, K. Helzlsouer, T. Gary, A. Klassen (2007)
The built environment and obesity.Epidemiologic reviews, 29
Jia Deng, Wei Dong, R. Socher, Li-Jia Li, K. Li, Li Fei-Fei (2009)
ImageNet: A large-scale hierarchical image database2009 IEEE Conference on Computer Vision and Pattern Recognition
Olga Russakovsky, Jia Deng, Hao Su, J. Krause, S. Satheesh, Sean Ma, Zhiheng Huang, A. Karpathy, A. Khosla, Michael Bernstein, A. Berg, Li Fei-Fei (2014)
ImageNet Large Scale Visual Recognition ChallengeInternational Journal of Computer Vision, 115
A. Rundle, A. Roux, L. Freeman, Douglas Miller, K. Neckerman, Christopher Weiss (2007)
The Urban Built Environment and Obesity in New York City: A Multilevel AnalysisAmerican Journal of Health Promotion, 21
Ferreira (2007)
Environmental correlates of physical activity in youth—a review and update.Obes Rev, 8
H. Burdette, R. Whitaker (2004)
Neighborhood playgrounds, fast food restaurants, and crime: relationships to overweight in low-income preschool children.Preventive medicine, 38 1
Isabel Ferreira, K. Horst, W. Wendel-Vos, S. Kremers, F. Lenthe, J. Brug (2007)
Environmental correlates of physical activity in youth – a review and updateObesity Reviews, 8
Neal Jean, M. Burke, Sang Xie, W. Davis, D. Lobell, Stefano Ermon (2016)
Combining satellite imagery and machine learning to predict povertyScience, 353
J. Reed, B. Ainsworth (2007)
Perceptions of Environmental Supports on the Physical Activity Behaviors of University Men and Women: A Preliminary InvestigationJournal of American College Health, 56
Jeffery (2006)
Are fast food restaurants an environmental risk factor for obesity?Int J Behav Nutr Phys Act, 3
C. Rutt, K. Coleman (2005)
Examining the relationships among built environment, physical activity, and body mass index in El Paso, TX.Preventive medicine, 40 6
Susan Babey, T. Hastert, Hongjian Yu, E. Brown (2008)
Physical activity among adolescents. When do parks matter?American journal of preventive medicine, 34 4
Key Points Question How can convolutional neural IMPORTANCE More than one-third of the adult population in the United States is obese. Obesity networks assist in the study of the has been linked to factors such as genetics, diet, physical activity, and the environment. However, association between the built evidence indicating associations between the built environment and obesity has varied across environment and obesity prevalence? studies and geographical contexts. Findings In this cross-sectional modeling study of 4 US urban areas, OBJECTIVE To propose an approach for consistent measurement of the features of the built extraction of built environment (ie, both environment (ie, both natural and modified elements of the physical environment) and its natural and modified elements of the association with obesity prevalence to allow for comparison across studies. physical environment) information from images using convolutional neural DESIGN The cross-sectional study was conducted from February 14 through October 31, 2017. A networks and use of that information to convolutional neural network, a deep learning approach, was applied to approximately 150 000 assess associations between the built high-resolution satellite images from Google Static Maps API (application programing interface) to environment and obesity prevalence extract features of the built environment in Los Angeles, California; Memphis, Tennessee; San showed that physical characteristics of a Antonio, Texas; and Seattle (representing Seattle, Tacoma, and Bellevue), Washington. Data on adult neighborhood (eg, the presence of obesity prevalence were obtained from the Centers for Disease Control and Prevention’s 500 Cities parks, highways, green streets, project. Regression models were used to quantify the association between the features and obesity crosswalks, diverse housing types) can prevalence across census tracts. be associated with variations in obesity prevalence across different MAIN OUTCOMES AND MEASURES Model-estimated obesity prevalence (obesity defined as body neighborhoods. mass index30, calculated as weight in kilograms divided by height in meters squared) based on built environment information. Meaning The convolutional neural network approach allows for consistent RESULTS The study included 1695 census tracts in 6 cities. The age-adjusted obesity prevalence was quantification of the features of the built 18.8% (95% CI, 18.6%-18.9%) for Bellevue, 22.4% (95% CI, 22.3%-22.5%) for Seattle, 30.8% (95% environment across neighborhoods and CI, 30.6%-31.0%) for Tacoma, 26.7% (95% CI, 26.7%-26.8%) for Los Angeles, 36.3% (95% CI, comparability across studies and 36.2%-36.5%) for Memphis, and 32.9% (95% CI, 32.8%-32.9%) for San Antonio. Features of the geographic regions. built environment explained 64.8% (root mean square error [RMSE], 4.3) of the variation in obesity prevalence across all census tracts. Individually, the variation explained was 55.8% (RMSE, 3.2) for Invited Commentary Seattle (213 census tracts), 56.1% (RMSE, 4.2) for Los Angeles (993 census tracts), 73.3% (RMSE, 4.5) for Memphis (178 census tracts), and 61.5% (RMSE, 3.5) for San Antonio (311 Supplemental content census tracts). Author affiliations and article information are listed at the end of this article. CONCLUSIONS AND RELEVANCE This study illustrates that convolutional neural networks can be used to automate the extraction of features of the built environment from satellite images for studying health indicators. Understanding the association between specific features of the built environment and obesity prevalence can lead to structural changes that could encourage physical activity and decreases in obesity prevalence. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 Open Access. This is an open access article distributed under the terms of the CC-BY License. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 1/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence Introduction The Global Burden of Disease study estimates that more than 603 million adults worldwide were 1 2-4 obese in 2015. In the United States, more than one-third of the adult population is obese, and 46 states have an estimated adult obesity rate of 25% or more. Obesity is a complex health issue that has been linked to a myriad of factors, including genetics, demographics, and behavior. Behavioral traits that encourage unhealthy food choices and a sedentary lifestyle have been associated with features in the social and built environment (ie, both natural and modified elements of the physical environment). The built environment can influence health through the availability of resources, such as housing, activity and recreational spaces, and measures of community design. 8-26 Studies have shown that certain features of the built environment can be associated with obesity and physical activity across different life stages. Research also supports the association between obesity and environmental factors, including walkability, land use, sprawl, area of residence, access to resources (eg, recreational facilities and food outlets), level of deprivation, and perceived 27-30 safety. Proximity and access to natural spaces and sidewalks can lead to increased and regular 31-35 physical activity, especially in urban areas. Despite these associations between obesity and the built environment, inconsistencies have been noted across studies and geographic contexts on the association between specific features of 28,36-38 the built environment and obesity prevalence. These inconsistencies could be due to variations in measures and measurement tools across studies, making it difficult to assess and compare findings. Furthermore, the process of measuring these features can be costly, time- consuming, and subject to human judgment and bias. Approaches that enable consistent measurement and allow for comparison across studies are needed. Assessing and quantifying the association of the built environment with obesity would be useful for selecting and implementing 4,29 appropriate community-based interventions and prevention efforts. Herein, we propose a method for comprehensively assessing the association between adult obesity prevalence and the built environment that involves extracting neighborhood physical features from high-resolution satellite imagery using a previously trained (hereafter termed pretrained) convolutional neural network (CNN), a deep learning approach. Nguyen et al used CNNs to classify images of the built environment from Google’s Street View to assess the association between obesity and the presence of crosswalks, building types, and street greenness or landscaping. However, their study did not take full advantage of CNNs for independently discovering features that can be associated with obesity prevalence and was limited to 3 predetermined features. In contrast, we comprehensively assess features in the built environment and demonstrate our approach by providing fine-grained associations with obesity prevalence at the census tract level for 4 US regions. Our approach is also scalable and relies on openly available data and computational tools and can enable comparability across studies. Methods Obesity Prevalence Data We used 2014 estimates of annual crude obesity prevalence at the census tract level from the 500 2,40,41 Cities project. These estimates were computed from the Behavioral Risk Factor Surveillance System data, wherein the survey respondents are 18 years or older and the body mass index (BMI; calculated as weight in kilograms divided by height in meters squared) threshold for obesity is 30. We selected cities from states with high (Tennessee and Texas) and low (Washington and California) prevalence of obesity. The 6 cities selected included Los Angeles, California; Memphis, Tennessee; San Antonio, Texas; and Seattle, Tacoma, and Bellevue, Washington. Because Seattle, Tacoma, and Bellevue are neighboring cities with few census tracts, we combined their data into a single data set, hereinafter referred to as Seattle. The study was exempt from institutional review board approval because the research involved the study of existing data and records collected by external parties in JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 2/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence such a manner that individuals cannot be identified. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline where applicable. The analysis consisted of 2 steps. First, we processed satellite images to extract features of the built environment using the CNN and extracted and processed point-of-interest (POI) data. Second, we used elastic net regression to build a parsimonious model to assess the association between the built environment and obesity prevalence. Acquiring Satellite Imagery and POI Data We downloaded images from Google Static Maps API (application programing interface) by providing the geographic center, image dimensions, and zoom level for each image. The zoom level and image dimensions were set to 18 and 400 × 400 pixels, respectively, for the entire data set. For each city, we divided its geographic span into a square grid, where each point is a pair of latitude and longitude values and the grid spacing is approximately 150 m. Further, we used census tract shapefiles to associate each image with its respective census tract and excluded images that were from areas outside the city limits. We used the same square grid to select geographic locations and performed a radial nearby search within an appropriate distance to download POI data through the Google Places of Interest API. Points of interest that were located outside city limits were excluded. We collected a set of 96 unique POI categories, and for each census tract we counted the number of locations associated with each category (eTable 1 in the Supplement). All POI data and satellite images were initially downloaded from February 14 through 28, 2017, and updated during the study period, which lasted through October 31, 2017. Satellite images downloaded through the API are only marked with the date of download represented by a time stamp and watermark at the bottom. Image Processing Convolutional neural networks have achieved groundbreaking success with large data sets in critical computer vision tasks (eg, object recognition, image segmentation) as well as health-related 42 43 applications (eg, recognition of skin cancer) and estimating poverty. Owing to the lack of a large labeled data set for classifying high- and low-obesity regions, we adopted a transfer learning approach (eFigures 1 and 2 in the Supplement), which involves using a pretrained network to extract features of the built environment from our unlabeled data set of nearly 150 000 satellite images. Transfer learning involves fine-tuning the pretrained CNN for a new task (with modification to the output layer) or using the pretrained CNN as a fixed feature extractor combined with linear classifiers or regression models. These approaches have been successfully implemented to perform computer vision tasks that are markedly different from object recognition. We used the VGG-CNN-F network, which is composed of 8 layers (5 convolutional and 3 fully connected) and is trained on approximately 1.2 million images from the ImageNet database (a data set of >14 million images used for large-scale visual recognition challenges) for recognizing objects belonging to 1000 categories. The network learns to extract gradients, edges, and patterns that 48,49 aid in object detection. Studies using similar transfer learning approaches have shown that features extracted from networks trained on the ImageNet data are effective at classifying aerial imagery into fine-grained semantic classes of land use (eg, golf courses, bridges, parking lots, buildings, and roads). We collected outputs from the second fully connected layer of the network for each image in 45,50 our data set. The second fully connected layer has 4096 nodes, each of which has nonlinear connections with all other nodes in the previous and next layers. Each feature vector has 4096 dimensions, corresponding to the output (also termed activations) from these nodes. These outputs were further aggregated into mean feature vectors for each census tract by computing the mean from all images belonging to a census tract. We do not link these features to specific elements in the built environment. Rather, these features collectively represent an indicator of the built environment. To investigate whether the CNN can differentiate between built environment features, we made a forward pass through the network for a randomly selected set of images and examined JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 3/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence the output maps from convolutional layers of the CNN (Figure 1). We also group image features to illustrate that built environment characteristics in areas with low and high obesity prevalence are distinct (eFigure 3 in the Supplement). Statistical Analysis We applied Elastic Net, a regularized regression method that eliminates insignificant covariates, preserves correlated variables, and is well suited to the high-dimensional (n = 4096) feature vectors extracted from our image data set. Regularization in Elastic Net guards against overfitting, which is a concern given the high dimensionality of our feature data set. To select an appropriate value for the tuning parameter (λ value), we used cross-validation and selected the value that minimized the mean cross-validated error. We performed 5-fold cross-validated regression analyses to quantify the following associations: (1) between features of built environment and prevalence of obesity at the census tract level, (2) Figure 1. Visualization of Features Identified by the Convolutional Neural Network (CNN) Model Building Green cover Roads Water A Filter outputs corresponding to water and buildings Filter outputs corresponding to green cover and buildings C Filter outputs corresponding to roads and green cover The images on the left column are satellite images taken from Google Static Maps API (application programing interface). Images in the middle and right columns are activation maps taken from the second convolutional layer of VGG-CNN-F network after forward pass of the respective satellite images through the network. The CNN understands image by interpreting the output from filters learned during the training phase. The activation maps may not always align exactly with the original image owing to padding of output within the CNN. From Google Static Maps API, DigitalGlobe, US Geological Survey (accessed July 2017). JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 4/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence between density of POIs and prevalence of obesity at the census tract level, and (3) between features of built environment and per capita income obtained from the American Community Survey 2014 five-year estimates at the census tract level (eTable 2 in the Supplement). We also split the data into 2 random samples, using 1 sample representing 60% of the data in model fitting and the remaining 40% in validation across all analysis. These analyses were performed jointly for all regions and independently for each region. Additional details are available in eMethods in the Supplement. Results We included 1695 census tracts from the 6 cities. Based on 2010 census data, individuals 18 years and older constituted 78.8% (n = 96 410) of the population of Bellevue, 84.6% (n = 515 147) of the population of Seattle, 77.0% (n = 152 760) of the population of Tacoma, 76.9% (n = 2 918 096) of the population of Los Angeles, 74.0% (n = 478 921) of the population of Memphis, and 73.2% (n = 971 407) of the population of San Antonio. The per capita income varied across these cities, with cities such as Bellevue and Seattle having much higher values. Specifically, the corresponding values in US dollars obtained from the American Community Survey 2014 five-year estimates were $50 405 for Bellevue, $44 167 for Seattle, $26 805 for Tacoma, $28 320 for Los Angeles, $21 909 for Memphis, and $22 784 for San Antonio. The cities with the highest per capita income also had the lowest city-level age-adjusted obesity prevalence estimates, at 18.8% (95% CI, 18.6%-18.9%), and 22.4% (95% CI, 22.3%-22.5%), respectively, for Bellevue and Seattle. In contrast, the age-adjusted obesity prevalence was 30.8% (95% CI, 30.6%-31.0%) for Tacoma, 26.7% (95% CI, 26.7%-26.8%) for Los Angeles, 36.3% (95% CI, 36.2%-36.5%) for Memphis, and 32.9% (95% CI, 32.8%-32.9%) for San Antonio. Visualization of the outputs from the convolutional layers of the VGG-CNN-F suggests that our model learns to identify features of the environment that have been associated with obesity from the satellite images. Specifically, the CNN captured gradients and edges corresponding to the presence of roads, buildings, trees, water, and land (Figure 1). After regularization, we retained 125 features for all cities combined, 157 for Los Angeles, 79 for Memphis, 69 for San Antonio, and 85 for Seattle. These features of the built environment explained 64.8% (root mean square error [RMSE], 4.3) of the variation in obesity prevalence in out-of-sample estimates across all 1695 census tracts based on the elastic net regression. Individually, our models explained 55.8% (RMSE, 3.2) of the variation in obesity prevalence for Seattle (213 census tracts), 56.1% (RMSE, 4.2) for Los Angeles (993 census tracts), 73.3% (RMSE, 4.5) for Memphis (178 census tracts), and 61.5% (RMSE, 3.5) for San Antonio (311 census tracts) in out-of-sample estimates (Figure 2 and Figure 3 and eFigures 4-11 in the Supplement). Our approach consistently presents a strong association between obesity prevalence and the built environment indicator across all 4 regions, despite varying city and neighborhood values. These high associations between features of the built environment and obesity were achieved without labeling the satellite images or fine-tuning the CNN model to differentiate between images from neighborhoods with high vs low prevalence of obesity. Compared with the features of the built environment, the POI data explained 42.4% (RMSE, 4.3) of the variation in obesity prevalence across all 1695 census tracts in out-of-sample estimates. The variation explained at the regional level was approximately 14.0% (RMSE, 4.5) for the 213 Seattle census tracts, 29.2% (RMSE, 5.4) for the 993 Los Angeles census tracts, 43.0% (RMSE, 4.1) for the 311 San Antonio census tracts, and 43.2% (RMSE, 6.7) for the 178 Memphis census tracts. We illustrate the linear correlation between the actual obesity prevalence and our model- estimated prevalence and compare the findings using the image features and POI data in Figure 4 and eFigure 12 in the Supplement. One possible explanation of the significant associations between the data on the built environment features and obesity prevalence is that the data might be representative of socioeconomic indicators such as income. Specifically, the variation in per capita income explained by the features of the built environment was 37.6% for the 213 Seattle census tracts, 62.1% for the 993 JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 5/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence Los Angeles census tracts, 58.2% for the 311 San Antonio census tracts, and 23.2% for the 178 Memphis census tracts (eFigures 13-16 in the Supplement). These observations suggest that for cities such as Los Angeles and San Antonio, most of the significant association between obesity prevalence and the features of the built environment could potentially be explained by variations in socioeconomic status. This suggestion agrees with a previous study that used CNNs and satellite imagery to assess the association between the built environment and poverty. However, the inconsistency in the associations across the 4 regions also suggests that features discerned by the CNN might capture additional information not directly linked to socioeconomic indicators. Figure 2. Actual Obesity Prevalence and Cross-Validated Model Estimates of Obesity Prevalence in High-Prevalence Areas A Seattle, Washington Bellevue Seattle Tacoma Bellevue Seattle Tacoma 15.2 18.1 20.9 23.8 26.6 29.5 32.3 38.0 40.9 15.2 18.1 20.9 23.8 26.6 29.5 32.3 38.0 40.9 Obesity Prevalence, % Obesity Prevalence, % B Los Angeles, California 11.3 15.4 19.6 23.7 27.8 32.0 36.1 40.2 44.4 48.5 11.3 15.4 19.6 23.7 27.8 32.0 36.1 40.2 44.4 48.5 Obesity Prevalence, % Obesity Prevalence, % Images on the right represent actual obesity prevalence; on the left, cross-validated estimates of obesity prevalence based on features of the built environment extracted from satellite images. Images from the Seattle region include Bellevue, Seattle, and Tacoma. The gray shaded regions do not have data. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 6/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence Discussion In this study, we used a CNN, a deep learning approach, to extract data representing features of the built environment from high-resolution satellite images to examine the association between the built environment and the prevalence of obesity across 4 regions. The CNN features capture different aspects of the environment, such as greenery and different housing types, that have been associated with physical activity and obesity. Our results demonstrate a consistent association between the built environment indicator and obesity prevalence across neighborhoods with low and high prevalence of adult obesity. A sampling of the satellite images from various census tracts with high and low prevalence of obesity indicates that these environments have features that have been linked with low and high prevalence of obesity in other studies. For example, we observe neighborhoods with Figure 3. Actual Obesity Prevalence and Cross-Validated Model Estimates of Obesity Prevalence in Low-Prevalence Areas A Memphis, Tennessee 15.6 19.4 23.1 26.9 30.6 34.4 38.1 41.9 45.6 15.6 19.4 23.1 26.9 30.6 34.4 38.1 41.9 45.6 Obesity Prevalence, % Obesity Prevalence, % B San Antonio, Texas 18.9 21.8 24.8 27.7 30.6 33.6 36.5 39.4 42.4 45.3 18.9 21.8 24.8 27.7 30.6 33.6 36.5 39.4 42.4 45.3 Obesity Prevalence, % Obesity Prevalence, % Images on the right represent actual obesity prevalence; on the left, cross-validated estimates of obesity prevalence based on features of the built environment extracted from satellite images. The gray shaded regions do not have data. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 7/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence tightly packed houses vs spaced housing (eFigure 11 in the Supplement), neighborhoods close to roadways vs those with smaller streets and crosswalks (eFigure 9 in the Supplement), neighborhoods with much vs less greenery (eFigure 7 in the Supplement), and affluent vs poor neighborhoods (eFigure 5 in the Supplement). In some instances, our models tended to underestimate obesity prevalence, especially for Seattle, which could be explained by the presence of green spaces in the region and the low variation of obesity prevalence across most of the region. Overestimation of obesity prevalence as observed in some of the high-income census tracts in the east of Memphis could be explained by the presence of features that do not encourage physical activity; however, individuals can more readily afford gym memberships and other recreational facilities. These observations therefore suggest that the features of the built environment can be used in combination with other data sources for monitoring obesity prevalence, and these data could be useful for regions with delayed updates on obesity estimates and for programs focused on reducing obesity. Our study is important for several reasons. First, although the built environment has been postulated to have an effect on obesity prevalence, studies have produced varying results, partly because measures of the built environment can be subjective, sometimes relying on participant or researcher perceptions. Second, in some studies, neighborhood audits of features of the built environment have heretofore been conducted using costly and time-consuming on-site visits or neighborhood surveys. The development of data algorithms that can automatically process satellite images to create indicators of the built environment would dramatically lower the cost and allow for Figure 4. Scatterplots of Model-Estimated and Model-Predicted Obesity Prevalence Plotted Against Actual Obesity Prevalence A Cross-validated model estimates based on built environment B Out-of-sample prediction based on built environment 60 60 50 50 40 40 30 30 20 20 10 10 0 0 0 10 20 30 40 50 0 10 20 30 40 50 Actual Obesity Prevalence, % Actual Obesity Prevalence, % C Cross-validated model estimates based on density of POI data D Out-of-sample predictions of obesity prevalence based on density of POI data 60 60 50 50 40 40 30 30 20 20 10 10 0 0 0 10 20 30 40 50 0 10 20 30 40 50 Actual Obesity Prevalence, % Actual Obesity Prevalence, % Built environment information is extracted from satellite images. POI indicates point of interest. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 8/14 Model-Estimated Obesity Prevalence, % Model-Estimated Obesity Prevalence, % Model-Predicted Obesity Prevalence, % Model-Predicted Obesity Prevalence, % JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence investigations of the effect of place characteristics on obesity prevalence. Third, our study presents strong and consistent evidence suggesting that the built environment may be a significant indicator of obesity prevalence. The proportion of variation (R ) in obesity prevalence explained in studies that report this measure has varied. For example, the percentage of between–census tract variance explained increased from 77% to 87% when built environment variables were added to a model for BMI composed of demographic variables. However, not all the built environment features (ie, land use mix, subway density, bus stop density, intersection density) were significant. On adding built environment variables (including land-use mix, distribution of fast-food outlets, street connectivity, access to public transportation, and green and open spaces) to a model for overweight and obesity that included individual and neighborhood level covariates, Li et al observed that the model explained neighborhood-level variation of 0.981. The proportion of variation observed in our analyses could be owing to the comprehensiveness of our approach and possible confounding factors such as socioeconomic status. However, although our findings are likely to be explained at least in part by socioeconomic indicators, such as income, our analyses also suggest that the built environment features more consistently estimate obesity than per capita income across all regions. A possible explanation is that the features extracted include man-made changes to the built environment and natural features (eg, parks and forests) that might not always be indicative or particularly associated with socioeconomic status. We also demonstrate that the density of POI can also be associated with variations in obesity prevalence, but to a lesser extent when compared with results obtained using the data extracted from satellite images. The POI data displayed higher associations with obesity prevalence for regions with higher obesity prevalence (ie, San Antonio and Memphis) and less so in regions with lower obesity prevalence. Furthermore, some of the most significant variables could be directly associated with health, diet, and exercise (eg, gyms, spas, restaurants, bakeries, supermarkets, bowling alleys), whereas others might be linked to other neighborhood characteristics (eg, natural features, pet stores, recreational vehicle parks). Also, strictly restricting the data to POIs associated with health and exercise resulted in poorer results. Furthermore, the POI data support the results from the satellite imagery analysis; specifically, both analyses suggest stronger associations between the built environment and obesity for San Antonio and Memphis compared with Los Angeles and Seattle. Our findings are relevant to researchers seeking to develop low-cost and timely methods that allow for direct measurement of the built environment to study its association with obesity and other health outcomes. All the data and computational methods used in this study are openly available, allowing for comparisons of study results across regions with varying populations and geographies. In addition, our results are also relevant to people monitoring obesity prevalence or working to develop public health programs to decrease obesity. We show that models fitted solely to the features of the built environment can provide reasonable estimates of neighborhood obesity prevalence, which are typically delayed from official sources by several years. For this study, we used obesity prevalence estimates from 2014, because more recent values were unavailable. Ideally, methods and programs should focus on combining individual- and neighborhood-level data to provide timely estimates of neighborhood obesity prevalence. Limitations This study has some limitations. First, the obesity prevalence estimates from the Behavioral Risk Factor Surveillance System are based on self-reported height and weight, which have been shown to 55,56 be biased and tend to lead to lower estimates of obesity prevalence. In addition, BMI does not allow for the direct measurement of body fat, which can vary across sex, age, race, and ethnicity. Furthermore, mortality and morbidity risk may vary across different race and ethnicities at the same 57,58 BMI. Differences also occur in the timing of the obesity data and the satellite images, which can introduce bias into our analysis. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 9/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence Furthermore, we used a CNN that is trained for object recognition to extract features relevant to the built environment because of the lack of a labeled data set for high- and low-obesity areas. This approach puts some restrictions on the interpretability of features used in our model. A CNN trained to classify land use patterns (eg, the UC [University of California] Merced land use data set ) also encodes fine-grained information about the built environment. Thus, alternatively, we can use such a CNN trained on a large land-use data set as a fixed-feature extractor and examine the performance of these vectors for classifying high- and low-obesity areas. This CNN will allow us to associate land use patterns with obesity trends and lend greater interpretability to prediction models. However, we were unable to find openly available networks pretrained on aerial and satellite imagery data sets for such a task. Google Street View images are a potential resource for capturing community activity levels at greater precision. For example, presence of persons jogging in the neighborhood might be an indicator of lower obesity prevalence in the community. Conclusions Our study provides evidence of the efficacy of CNNs at associating obesity prevalence with significant physical environment features and opens possibilities for refining the methods for a more consistent and useful application. To make our approach useful for public health and community planning efforts, our future efforts will focus on assessing disparities based on neighborhood racial composition and socioeconomic status. Socioeconomic status has been linked to obesity and other health outcomes. For example, Gordon-Larsen et al found that lower socioeconomic status blocks were more likely to have fewer physical activity facilities, and these disparities in access could be associated with differences in overweight patterns. Review studies focused on the African American population and disadvantaged populations highlighted a strong association between safety and 38,60 physical activity and obesity, especially in urban regions. This finding could also explain the associations we noted between police stations and the high obesity prevalence in the POI data. These populations were also less likely to have access to needed resources and environments typically associated with lower obesity prevalence. Furthermore, longitudinal studies using our approach can assess changes in the built environment that may be associated with increases or decreases in neighborhood obesity prevalence. Results in this study support the association between features of the built environment and obesity prevalence. Neighborhood-level interventions to encourage physical activity and increase access to healthy food outlets could be combined with individual-level interventions to aid in curbing the obesity epidemic. ARTICLE INFORMATION Accepted for Publication: May 11, 2018. Published: August 31, 2018. doi:10.1001/jamanetworkopen.2018.1535 Open Access: This is an open access article distributed under the terms of the CC-BY License. © 2018 Maharana A et al. JAMA Network Open. Corresponding Author: Elaine Okanyene Nsoesie, PhD, Institute for Health Metrics and Evaluation, University of Washington, 2301 Fifth Ave, Ste 600, Seattle, WA 98121 ([email protected]). Author Affiliations: Department of Biomedical Informatics and Medical Education, University of Washington, Seattle (Maharana); Institute for Health Metrics and Evaluation, University of Washington, Seattle (Nsoesie). Author Contributions: Both authors had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Concept and design: Both authors. Acquisition, analysis, or interpretation of data: Both authors. Drafting of the manuscript: Both authors. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 10/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence Critical revision of the manuscript for important intellectual content: Both authors. Statistical analysis: Both authors. Conflict of Interest Disclosures: None reported. Funding/Support: Funding for this project was provided by the Robert Wood Johnson Foundation (grant 73362). Role of the Funder/Sponsor: The funder/sponsor had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication. REFERENCES 1. Afshin A, Forouzanfar MH, Reitsma MB, et al; GBD 2015 Obesity Collaborators. Health effects of overweight and obesity in 195 countries over 25 years. N Engl J Med. 2017;377(1):13-27. doi:10.1056/NEJMoa1614362 2. Centers for Disease Control and Prevention. Overweight & obesity: adult obesity facts. October 2016. https:// www.cdc.gov/obesity/data/adult.html. Updated June 12, 2018. Accessed May 19, 2017. 3. Dwyer-Lindgren L, Freedman G, Engell RE, et al. Prevalence of physical activity and obesity in US counties, 2001-2011: a road map for action. Popul Health Metr. 2013;11:7. doi:10.1186/1478-7954-11-7 4. Hales CM, Carroll MD, Fryar CD, Ogden CL. Prevalence of obesity among adults and youth: United States, 2015– 2016. NCHS Data Brief No. 288. https://www.cdc.gov/nchs/products/databriefs/db288.htm. Updated October 13, 2017. Accessed October 14, 2017. 5. The State of Obesity. Obesity rates & trends—adult obesity in the United States. https://stateofobesity.org/rates/. Published August 2017. Accessed October 14, 2017. 6. Ghosh S, Bouchard C. Convergence between biological, behavioural and genetic determinants of obesity. Nat Rev Genet. 2017;18(12):731-748. doi:10.1038/nrg.2017.72 7. Centers for Disease Control and Prevention. About healthy places. 2007. https://www.cdc.gov/healthyplaces/ factsheets.htm. Reviewed October 21, 2016. Accessed July 14, 2018. 8. Giles-Corti B, Macintyre S, Clarkson JP, Pikora T, Donovan RJ. Environmental and lifestyle factors associated with overweight and obesity in Perth, Australia. Am J Health Promot. 2003;18(1):93-102. doi:10.4278/0890- 1171-18.1.93 9. Saelens BE, Sallis JF, Black JB, Chen D. Neighborhood-based differences in physical activity: an environment scale evaluation. Am J Public Health. 2003;93(9):1552-1558. doi:10.2105/AJPH.93.9.1552 10. Burdette HL, Whitaker RC. Neighborhood playgrounds, fast food restaurants, and crime: relationships to overweight in low-income preschool children. Prev Med. 2004;38(1):57-63. doi:10.1016/j.ypmed.2003.09.029 11. Frank LD, Andresen MA, Schmid TL. Obesity relationships with community design, physical activity, and time spent in cars. Am J Prev Med. 2004;27(2):87-96. doi:10.1016/j.amepre.2004.04.011 12. Maddock J. The relationship between obesity and the prevalence of fast food restaurants: state-level analysis. Am J Health Promot. 2004;19(2):137-143. doi:10.4278/0890-1171-19.2.137 13. Ellaway A, Macintyre S, Bonnefoy X. Graffiti, greenery, and obesity in adults: secondary analysis of European Cross Sectional Survey. BMJ. 2005;331(7517):611-612. doi:10.1136/bmj.38575.664549.F7 14. Rutt CD, Coleman KJ. Examining the relationships among built environment, physical activity, and body mass index in El Paso, TX. Prev Med. 2005;40(6):831-841. doi:10.1016/j.ypmed.2004.09.035 15. Sturm R, Datar A. Body mass index in elementary school children, metropolitan area food prices and food outlet density. Public Health. 2005;119(12):1059-1068. doi:10.1016/j.puhe.2005.05.007 16. Lopez-Zetina J, Lee H, Friis R. The link between obesity and the built environment: evidence from an ecological analysis of obesity and vehicle miles of travel in California. Health Place. 2006;12(4):656-664. doi:10.1016/j. healthplace.2005.09.001 17. Ewing R, Schmid T, Killingsworth R, Zlot A, Raudenbush S. Relationship between urban sprawl and physical activity, obesity and morbidity. Am J Health Promot. 2003;18(1):47-57. doi:10.4278/0890-1171-18.1.47 18. Nelson MC, Gordon-Larsen P, Song Y, Popkin BM. Built and social environments associations with adolescent overweight and activity. Am J Prev Med. 2006;31(2):109-117. doi:10.1016/j.amepre.2006.03.026 19. Morland K, Diez Roux AV, Wing S. Supermarkets, other food stores, and obesity: the Atherosclerosis Risk in Communities Study. Am J Prev Med. 2006;30(4):333-339. doi:10.1016/j.amepre.2005.11.003 20. Mobley LR, Root ED, Finkelstein EA, Khavjou O, Farris RP, Will JC. Environment, obesity, and cardiovascular disease risk in low-income women. Am J Prev Med. 2006;30(4):327-332. doi:10.1016/j.amepre.2005.12.001 21. Gordon-Larsen P, Nelson MC, Page P, Popkin BM. Inequality in the built environment underlies key health disparities in physical activity and obesity. Pediatrics. 2006;117(2):417-424. doi:10.1542/peds.2005-0058 JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 11/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence 22. Doyle S, Kelly-Schwartz A, Schlossberg M, Stockard J. Active community environments and health: the relationship of walkable and safe communities to individual health. J Am Plann Assoc. 2006;72(1):19-31. doi:10.1080/ 23. Ewing R, Brownson RC, Berrigan D. Relationship between urban sprawl and weight of United States youth. Am J Prev Med. 2006;31(6):464-474. doi:10.1016/j.amepre.2006.08.020 24. Lopez R. Urban sprawl and risk for being overweight or obese. Am J Public Health. 2004;94(9):1574-1579. doi:10. 2105/AJPH.94.9.1574 25. Kelly-Schwartz AC, Stockard J, Doyle S, Schlossberg M. Is sprawl unhealthy? a multilevel analysis of the relationship of metropolitan sprawl to the health of individuals. J Plann Educ Res. 2004;24(2):184-196. doi:10.1177/ 0739456X04267713 26. Inagami S, Cohen DA, Finch BK, Asch SM. You are where you shop: grocery store locations, weight, and neighborhoods. Am J Prev Med. 2006;31(1):10-17. doi:10.1016/j.amepre.2006.03.019 27. Booth KM, Pinkston MM, Poston WSC. Obesity and the built environment. J Am Diet Assoc. 2005;105 (5)(suppl 1):S110-S117. doi:10.1016/j.jada.2005.02.045 28. Black JL, Macinko J. Neighborhoods and obesity. Nutr Rev. 2008;66(1):2-20. doi:10.1111/j.1753-4887. 2007.00001.x 29. Papas MA, Alberg AJ, Ewing R, Helzlsouer KJ, Gary TL, Klassen AC. The built environment and obesity. Epidemiol Rev. 2007;29(1):129-143. doi:10.1093/epirev/mxm009 30. Li F, Harmer P, Cardinal BJ, et al. Built environment and 1-year change in weight and waist circumference in middle-aged and older adults: Portland Neighborhood Environment and Health Study. Am J Epidemiol. 2009;169 (4):401-408. doi:10.1093/aje/kwn398 31. Ferreira I, van der Horst K, Wendel-Vos W, Kremers S, van Lenthe FJ, Brug J. Environmental correlates of physical activity in youth—a review and update. Obes Rev. 2007;8(2):129-154. doi:10.1111/j.1467-789X. 2006.00264.x 32. Babey SH, Hastert TA, Yu H, Brown ER. Physical activity among adolescents: when do parks matter? Am J Prev Med. 2008;34(4):345-348. doi:10.1016/j.amepre.2008.01.020 33. Reed J, Ainsworth B. Perceptions of environmental supports on the physical activity behaviors of university men and women: a preliminary investigation. J Am Coll Health. 2007;56(2):199-204. doi:10.3200/JACH. 56.2.199-208 34. McCormack G, Giles-Corti B, Lange A, Smith T, Martin K, Pikora TJ. An update of recent evidence of the relationship between objective and self-report measures of the physical environment and physical activity behaviours. J Sci Med Sport. 2004;7(1)(suppl):81-92. doi:10.1016/S1440-2440(04)80282-2 35. Kaczynski AT, Henderson KA. Parks and recreation settings and active living: a review of associations with physical activity function and intensity. J Phys Act Health. 2008;5(4):619-632. doi:10.1123/jpah.5.4.619 36. Rundle A, Neckerman KM, Freeman L, et al. Neighborhood food environment and walkability predict obesity in New York City. Environ Health Perspect. 2009;117(3):442-447. doi:10.1289/ehp.11590 37. Jeffery RW, Baxter J, McGuire M, Linde J. Are fast food restaurants an environmental risk factor for obesity? Int J Behav Nutr Phys Act. 2006;3(1):2. doi:10.1186/1479-5868-3-2 38. Lovasi GS, Hutson MA, Guerra M, Neckerman KM. Built environments and obesity in disadvantaged populations. Epidemiol Rev. 2009;31(1):7-20. doi:10.1093/epirev/mxp005 39. Nguyen QC, Sajjadi M, McCullough M, et al. Neighbourhood looking glass: 360º automated characterisation of the built environment for neighbourhood effects research. J Epidemiol Community Health. 2018;72(3): 260-266. doi:10.1136/jech-2017-209456 40. Centers for Disease Control and Prevention. 500 Cities: Local Data for Better Health. https://www.cdc.gov/ 500cities/. Updated November 28, 2017. Accessed September 13, 2017. 41. Centers for Disease Control and Prevention. 500 Cities: Local Data for Better Health. Unhealthy Behaviors. https:// www.cdc.gov/500cities/definitions/unhealthy-behaviors.htm. Updated December 12, 2016. Accessed September 13, 2017. 42. Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115-118. doi:10.1038/nature21056 43. Jean N, Burke M, Xie M, Davis WM, Lobell DB, Ermon S. Combining satellite imagery and machine learning to predict poverty. Science. 2016;353(6301):790-794. doi:10.1126/science.aaf7894 JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 12/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence 44. Sharif Razavian A, Azizpour H, Sullivan J, Carlsson S. CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops; Columbus, OH; 2014:806-813. 45. Chatfield K, Simonyan K, Vedaldi A, Zisserman A. Return of the devil in the details: delving deep into convolutional nets. https://arxiv.org/abs/1405.3531. Revised November 5, 2014. Accessed July 14, 2018. 46. ImageNet. http://www.image-net.org. Accessed May 9, 2018. 47. ImageNet Large Scale Visual Recognition Challenge 2014 (ILSVRC2014). http://image-net.org/challenges/ LSVRC/2014/browse-synsets. Accessed May 8, 2018. 48. Marmanis D, Datcu M, Esch T, Stilla U. Deep learning earth observation classification using ImageNet pretrained networks. IEEE Geosci Remote Sens Lett. 2016;13(1):105-109. doi:10.1109/LGRS.2015.2499239 49. Cheng G, Han J, Lu X. Remote sensing image scene classification: benchmark and state of the art. Proc IEEE. 2017;105(10):1865-1883. doi:10.1109/JPROC.2017.2675998 50. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L. Imagenet: A large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition; Miami Beach, FL; 2009: 248-255. 51. Friedman J, Hastie T, Tibshirani R. The Elements of Statistical Learning: Vol 1: Springer Series in Statistics. Berlin, Germany: Springer; 2001. 52. US Census Bureau. American Fact Finder: 2014 American Community Survey 5-Year Estimates. Table B19301 generated by Adyasha Maharana. http://factfinder2.census.gov. Accessed June 16, 2017. 53. Rundle A, Roux AV, Freeman LM, Miller D, Neckerman KM, Weiss CC. The urban built environment and obesity in New York City: a multilevel analysis. Am J Health Promot. 2007;21(4)(suppl):326-334. doi:10.4278/0890-1171- 21.4s.326 54. Li F, Harmer PA, Cardinal BJ, et al. Built environment, adiposity, and physical activity in adults aged 50-75. Am J Prev Med. 2008;35(1):38-46. doi:10.1016/j.amepre.2008.03.021 55. Kuczmarski MF, Kuczmarski RJ, Najjar M. Effects of age on validity of self-reported height, weight, and body mass index: findings from the Third National Health and Nutrition Examination Survey, 1988-1994. J Am Diet Assoc. 2001;101(1):28-34. doi:10.1016/S0002-8223(01)00008-6 56. Merrill RM, Richardson JS. Validity of self-reported height, weight, and body mass index: findings from the National Health and Nutrition Examination Survey, 2001-2006. Prev Chronic Dis. 2009;6(4):A121. 57. Zheng W, McLerran DF, Rolland B, et al. Association between body-mass index and risk of death in more than 1 million Asians. N Engl J Med. 2011;364(8):719-729. doi:10.1056/NEJMoa1010679 58. Jafar TH, Islam M, Poulter N, et al. Children in South Asia have higher body mass-adjusted blood pressure levels than white children in the United States: a comparative study. Circulation. 2005;111(10):1291-1297. doi:10.1161/ 01.CIR.0000157699.87728.F1 59. Yang Y, Newsam S. Bag-of-visual-words and spatial extensions for land-use classification. In: Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems; New York, NY; 2010:270-279. 60. Casagrande SS, Whitt-Glover MC, Lancaster KJ, Odoms-Young AM, Gary TL. Built environment and health behaviors among African Americans: a systematic review. Am J Prev Med. 2009;36(2):174-181. doi:10.1016/j.amepre. 2008.09.037 SUPPLEMENT. eMethods. Materials and Data Analysis eTable 1. Places of Interest eTable 2. Demographic and Obesity Data eFigure 1. Illustration of Transfer Learning Approach eFigure 2. VGG-CNN-F Convolutional NN Architecture eFigure 3. t-SNE Visualization of Features Extracted From VGG-CNN-F for Satellite Imagery eFigure 4. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 5. Google Satellite Images for Seattle Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 6. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 7. Google Satellite Images for San Antonio Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 8. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 13/14 JAMA Network Open | Health Informatics Deep Learning to Examine the Built Environment and Neighborhood Adult Obesity Prevalence eFigure 9. Google Satellite Images for Memphis Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 10. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 11. Google Satellite Images for Los Angeles Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 12. Out-of-Sample Predictions of Obesity Prevalence Plotted Against Actual Obesity Prevalence eFigure 13. Cross-validated Model Estimates of Per Capita Income Plotted Against Actual Per Capita Income eFigure 14. Out-of-Sample Model Predictions of Per Capita Income Plotted Against Actual Per Capita Income eFigure 15. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income eFigure 16. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income eReferences. JAMA Network Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 August 31, 2018 14/14 Supplementary Online Content Maharana A, Nsoesie EO. Use of deep learning to examine the association of the built environment with prevalence of neighborhood adult obesity. JAMA Netw Open. 2018;1(4):e181535. doi:10.1001/jamanetworkopen.2018.1535 eMethods. Materials and Data Analysis eTable 1. Places of Interest eTable 2. Demographic and Obesity Data eFigure 1. Illustration of Transfer Learning Approach eFigure 2. VGG-CNN-F Convolutional NN Architecture eFigure 3. t-SNE Visualization of Features Extracted From VGG-CNN-F for Satellite Imagery eFigure 4. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 5. Google Satellite Images for Seattle Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 6. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 7. Google Satellite Images for San Antonio Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 8. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 9. Google Satellite Images for Memphis Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 10. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence eFigure 11. Google Satellite Images for Los Angeles Showing Locations With Low and High Obesity Prevalence, Respectively eFigure 12. Out-of-Sample Predictions of Obesity Prevalence Plotted Against Actual Obesity Prevalence eFigure 13. Cross-validated Model Estimates of Per Capita Income Plotted Against Actual Per Capita Income eFigure 14. Out-of-Sample Model Predictions of Per Capita Income Plotted Against Actual Per Capita Income eFigure 15. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income eFigure 16. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income eReferences. This supplementary material has been provided by the authors to give readers additional information about their work. © Maharana A et al. JAMA Network Open. © Maharana A et al. JAMA Network Open. eMethods. Materials and Data Analysis Data on Obesity Prevalence We obtained 2014 estimates on annual crude obesity prevalence from the 500 cities project . We used crude obesity prevalence estimates from 2014 because adjusted estimates and more recent annual estimates were unavailable. These obesity estimates are derived from data from the United States Centers for Disease Control and Prevention (CDC) Behavioral Risk Factor Surveillance System (BRFSS) . BRFSS is an annual cross-sectional telephone survey used to measure behavioral risk factors of U.S. residents. The BRFSS includes all 50 U.S. states, the District of Columbia and U.S. territories (i.e., Puerto Rico, Guam and the Virgin Islands). An individual is considered obese if their body mass index (BMI) is greater than or equal to 30 kg/m , where BMI is estimated from self-reported weight and height. Certain exclusions were applied, including removal of pregnant women and individuals reporting certain height and weight measurements . The data includes respondents 18 years of age and older. To illustrate our approach, we focused on estimates of obesity prevalence for census tracts in six cities – Los Angeles, California; Memphis, Tennessee; San Antonio, Texas; and Seattle, Tacoma and Bellevue, Washington. Because Seattle, Tacoma and Bellevue are neighboring cities with few census tracts, we combined their data into a single dataset, referred to as, Seattle. These cities were selected to reflect regions with varying obesity prevalence. Recent rankings of obesity prevalence by states lists Tennessee and Texas as sixth and eighth of fifty most obese © Maharana A et al. JAMA Network Open. 4 states, respectively . In contrast, the states of Washington and California have lower obesity rates and are ranked thirty-second and forty-seventh of fifty, respectively. Satellite Imagery and Places of Interest Data We downloaded recent satellite images for each census tract at the zoom level of 18 and image dimensions of 400 by 400 pixels (which was later resized to 224 x 224 for our analysis) from the freely available Google Static Maps API. Historical data matching the time period of the obesity estimates was unavailable. The satellite imagery data consisted of approximately 150,000 images, implying there were multiple images for each census tract. We also obtained a comprehensive list of places of interest (e.g., parks, restaurants, liquor stores, bus stations, night clubs) by performing a nearby search for each location in a square grid spanning a census tract, using the Google Nearby Search API. We included all categories of places of interest available through the API instead of focusing only on physical activity facilities, food and health locations which have been widely studied, because we reasoned that other categories could influence an individual’s health behavior and physical activity frequency. For example, a high density of pet stores could indicate high pet ownership which could influence how often people go to parks and take walks around the neighborhood. Furthermore, the places of interest data varied by city, with some cities, such as Los Angeles, having finer classifications than others. The data was further cleaned to avoid duplicate counts of the same location. A comprehensive list of places of interest categories are included in the eTable 1. © Maharana A et al. JAMA Network Open. Deep Neural Network Model Data extracted from satellite images have been used in several health-related applications ranging from infectious disease monitoring to estimating socioeconomic indicators such as, 5-7 poverty . To extract information from the 150,000 satellite images, we used a convolutional neural network (CNN), which is the state-of-art method for most computer vision tasks such as object recognition, scene labelling, image segmentation and pose estimation. More recently, CNNs have also been trained in image recognition for skin cancer, diagnosis of plant diseases 8-10 and classification of urban landscapes . To train a CNN from scratch to differentiate between regions with low and high obesity prevalence, we need a large corpus consisting of millions of labelled images. However, such training data is unavailable. Instead, we used a transfer learning approach (see eFigure 1), which involves applying a previously trained network to our dataset of unlabeled images to make inferences. We used a network, VGG-CNN-F, which has been previously trained (hereafter referred to as, pre-trained) for object recognition on the ILSVRC-2012 challenge dataset (subset 11,14 of ImageNet database) and is freely available to the research community . The trained network achieved 16.7% top-5 error on the challenge test set. This implies that the network is able to identify the correct class for every image within its top 5 predictions or predictions with highest probability. The network has helped achieve breakthroughs in other image recognition tasks through transfer learning. The convolutional neural network consists of five convolutional layers and three fully connected layers. Each convolutional layer is composed of several two-dimensional filters which activate © Maharana A et al. JAMA Network Open. the features required for classifying an object correctly. During training, the neural network learns to extract gradients, edges and patterns that aid in accurate object detection. The fully connected layers further process these features and convert them into single dimensional vectors. The output layer (final fully connected layer) is originally designed for classifying between 1000 object categories. Essentially, this neural network transforms a large two-dimensional image into a single vector of fixed dimension, containing only the most important descriptors of the image. This feature vector is extracted by deploying the network and making a forward pass through it for each image. It has been shown that these descriptors can be used with linear classifiers or regression models to perform tasks that are much different from object recognition . We employ this technique to transform satellite images of dimension 224x224 into a feature vector of length 4096, taken from the seventh hidden layer or second fully-connected layer of the VGG-CNN-F network. We also make a forward pass through the network for some images and examine the output maps from convolutional layers of the CNN, to see if built environment features are being highlighted by these filters and transmitted to the succeeding layers. The output maps are single channel images which can be plotted and compared to the original image for interpretation of the outputs. The results from this process are visualized in Figure 1. To investigate whether our approach could differentiate between images from areas with low and high obesity prevalence, we extracted the 4096-dimensional feature vectors from the second fully connected layer of VGG-CNN-F for each image and projected these vectors onto a 2- dimensional space via t-distributed stochastic neighbor embedding (t-SNE). t-SNE is a non- © Maharana A et al. JAMA Network Open. linear dimensionality reduction algorithm which has been known to preserve local neighborhoods at the expense of global structure . It can be implemented using various metrics for measuring distance between two vectors; we used the Euclidean distance. We also fit a 3- component Gaussian mixture model to tract-level obesity prevalence data in order to divide census tracts of a city into low, and high obesity areas. All images belonging to a high obesity census tract were tagged as high obesity areas for the purpose of visualization, similarly for low obesity census tracts. Further, the mean feature vector for a census tract is computed by taking the average of the vectors for all satellite images belonging to that particular census tract. We observe well-formed clusters for low and high obesity prevalence in the visualization for San Antonio, showing that built environment features in these areas are distinct (eFigure 3). The Gaussian distribution means for census tracts in San Antonio with low and high obesity prevalence are 26.7 and 38.4, respectively. In contrast, the Gaussian distribution means for census tracts in Seattle-Tacoma-Bellevue with low and high obesity prevalence are 20.6 and 31.9 only, making Seattle an area with low obesity prevalence. This is reflected in the t-SNE visualization for Seattle - the high obesity labelled data points are outnumbered and do not form distinct clusters within themselves. The clustering of images in eFigure 3, demonstrates that our approach can differentiate between images from census tracts with low and high obesity prevalence. Regression Modeling We conducted three different sets of analyses. Our first aim was to quantify the association between the features of the built environment and obesity prevalence at the census tract level. © Maharana A et al. JAMA Network Open. We assessed how much of the variation in obesity prevalence across all census tracts is explained by features of the built environment extracted from satellite images. Since the data contained a large number of features (n=4,096), we applied Elastic Net – a regularization and variable selection technique. A major benefit of Elastic Net is that it combines the advantages of Ridge regression and Least Absolute Shrinkage and Selection Operator (LASSO); insignificant covariates are eliminated, while correlated variables that are significant are maintained . Next, we evaluated how well our model predicts obesity prevalence across all cities by splitting the data into two sets – a random sample representing sixty percent of the data for fitting and the remaining forty percent for model evaluation. Our second aim was to quantify the association between the neighborhood density of places of interest (such as, fast food outlets and recreational facilities) to obesity prevalence. We used the same process for variable selection and regression modeling as previously described. We then compared the model coefficient of determination (R ), root mean squared error (RMSE) and Pearson correlation between the actual and estimated observations to our previous results on estimating obesity using data on the built environment extracted from satellite images. Lastly, we fit separate models to quantify how well the features of the built environment extracted from satellite images predict socioeconomic variables, such as per capita income. This analysis was undertaken because obesity prevalence tends to correlate with socioeconomic status and we wanted to investigate if this could partially explain the performance of our models in predicting obesity prevalence. © Maharana A et al. JAMA Network Open. 16 We used the R statistical software for all the regression modeling . Five-fold cross validation was applied to models for all regions except Memphis, for which we used a three-fold cross validation because the sample size was less than 200, which limits the number of data points used in each fold. To prevent over-fitting, we also limited the number of selected features to be less than or equal to the number of census of tracts. © Maharana A et al. JAMA Network Open. eTable 1. Places of Interest Accounting Car Wash Gas Station Lodging Police Airport Casino General Contractor Meal Delivery Post Office Amusement Grocery or Real Estate Cemetery Meal Takeaway Park Supermarket Agency Aquarium Church Gym Mosque Restaurant Roofing Art Gallery City Hall Hair Care Movie Rental Contractor ATM Clothing Store Hardware Store Movie Theater RV Park Convenience Moving Bakery Health School Store Company Bank Courthouse Hindu Temple Museum Shoe Store Bar Dentist Home Goods Store Natural Feature Shopping Mall Department Beauty Salon Hospital Neighborhood Spa Store Bicycle Store Doctor Insurance Agency Night Club Stadium Book Store Electrician Jewelry Store Painter Storage Electronics Bowling Alley Laundry Park Synagogue Store Bus Station Embassy Lawyer Parking Taxi Stand Cafe Finance Library Pet Store Train Station Campground Fire Station Light Rail Station Pharmacy Transit Station Car Dealer Florist Liquor Store Physiotherapist Travel Agency © Maharana A et al. JAMA Network Open. Local Government Place of Car Rental Funeral Home University Office Worship Car Repair Furniture Store Locksmith Plumber Veterinary Care Zoo © Maharana A et al. JAMA Network Open. eTable 2. Demographic and Obesity Data Bellevue Seattle Tacoma Los Memphis San Angeles Antonio 2010 122,363 608,660 198,397 3,792,621 646,889 1,327,407 Population Median age 38.5 36.1 35.1 34.1 33.0 32.7 (years) 18 years and 96,410 515,147 152,760 2,918,096 478,921 971,407 over (78.8%) (84.6%) (77.0%) (76.9%) (74.0%) (73.2%) Male 48,020 256,561 74,539 1,441,341 221,992 466,126 (39.2%) (42.2%) (37.6%) (38.0%) (34.3%) (35.1%) Female 48,390 258,586 78,221 1,476,755 256,929 505,281 (39.5%) (42.5%) (39.4%) (38.9%) (39.7%) (38.1%) Income (per 50,405 44,167 26,805 28,320 21,909 22,784 capita, 2014)* Obesity Crude 16.3 (95% 19.2 (95% 24.6 (95% 21.1 (95% 29.3 (95% 32.4 (95% prevalence CI, 16.2- CI, 19.2- CI, 24.6- CI, 21.1- CI, 29.2- CI, 32.4- 16.3) 19.3) 24.7) 21.1) 29.4) 32.5) Age-adjusted 18.8 (95% 22.4 (95% 30.8 (95% 26.7 (95% 36.3 (95% 32.9 (95% prevalence CI, 18.6- CI, 22.3- CI, 30.6- CI, 26.7- CI, 36.2- CI, 32.8- 18.9) 22.5) 31) 26.8) 36.5) 32.9) © Maharana A et al. JAMA Network Open. *American Community Survey, 2014 5-year estimate. eFigure 1. Illustration of Transfer Learning Approach Let classification of objects from the ImageNet dataset be Domain A. The model trained for domain A has learnt how to interpret an image. We use this knowledge acquired by the model to understand satellite images. When the satellite images are fed as input to the model, we receive feature maps that encode this knowledge. These feature maps are used in a regression model to predict obesity values which is domain B. (Left composite image courtesy of ImageNet: http://image-net.org/) © Maharana A et al. JAMA Network Open. eFigure 2. VGG-CNN-F Convolutional NN Architecture We have used the 8-layer VGG-CNN-F convolutional neural network for our project. In this architecture, the first five layers are a five-fold repetition of this arrangement: <convolutional layer->pooling layer->ReLU layer> Those layers are represented in blue. The next 3 layers (represented in green) are fully connected layers. (Left composite image courtesy of ImageNet: http://image-net.org/) © Maharana A et al. JAMA Network Open. eFigure 3. t-SNE Visualization of Features Extracted From VGG-CNN-F for Satellite Imagery (a) San Antonio, Texas and (b) Seattle-Tacoma-Bellevue area, Washington. © Maharana A et al. JAMA Network Open. eFigure 4. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence (a) actual and (b) cross-validated estimates of obesity prevalence for Bellevue (i), Seattle (ii) and Tacoma (iii), Washington based on the density of places of interest data. These cities are collectively referred to as Seattle in the manuscript. We do not have data for the gray shaded regions. © Maharana A et al. JAMA Network Open. eFigure 5. Google Satellite Images for Seattle Showing Locations With Low and High Obesity Prevalence, Respectively (Left grouping) Census tracts containing affluent neighborhoods (close to waterfront, swimming pool in apartment complex), waterfront businesses and downtowns are classified as low obesity areas in Seattle-Tacoma-Bellevue area. (Right grouping) Census tracts with small population density and less urbanized footprint are classified as high obesity areas in Seattle-Tacoma- Bellevue area. © Maharana A et al. JAMA Network Open. eFigure 6. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence (a) actual and (b) cross-validated estimates of obesity prevalence for San Antonio, Texas based on the density of places of interest data. Unlike Seattle, the places of interest data appear to capture the variability in obesity prevalence across census tracts. We do not have data for the gray shaded regions. © Maharana A et al. JAMA Network Open. eFigure 7. Google Satellite Images for San Antonio Showing Locations With Low and High Obesity Prevalence, Respectively (Top Row) Census Tract 48029190400 – Images show dense green cover in residential areas and school building. (Bottom Row) Census Tract 48029130402 –The neighborhoods are dominated by larger roadways. © Maharana A et al. JAMA Network Open. eFigure 8. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence (a) actual and (b) cross-validated estimates of obesity prevalence for Memphis, Tennessee based on the density of places of interest data. The places of interest data capture the variability in neighborhood obesity prevalence for Memphis much better than it does for Seattle and Los Angeles (below), where the overall obesity prevalence is lower. We do not have data for the gray shaded regions. © Maharana A et al. JAMA Network Open. eFigure 9. Google Satellite Images for Memphis Showing Locations With Low and High Obesity Prevalence, Respectively (Top Row) Census Tract 47157009600 – Images show green cover in neighborhood and walkways in field. (Bottom Row) Census Tract 47157000800 – Industrial area, presence of vehicles and sparse greenery. © Maharana A et al. JAMA Network Open. eFigure 10. Actual Obesity Prevalence and Cross-validated Model Estimates of Obesity Prevalence (a) actual and (b) cross-validated estimates of obesity prevalence for Los Angeles, California based on the density of places of interest data. © Maharana A et al. JAMA Network Open. eFigure 11. Google Satellite Images for Los Angeles Showing Locations With Low and High Obesity Prevalence, Respectively (Left grouping) High obesity census tracts are characterized by densely packed neighborhoods and less greenness. (Right grouping) Low obesity census tracts consist of mostly residential areas with more street greenness. © Maharana A et al. JAMA Network Open. eFigure 12. Out-of-Sample Predictions of Obesity Prevalence Plotted Against Actual Obesity Prevalence Obesity prevalence plotted against (a) Los Angeles, California (b) Memphis, Tennessee (c) San Antonio, Texas, (d) Seattle, Washington based on places of interest data. © Maharana A et al. JAMA Network Open. eFigure 13. Cross-validated Model Estimates of Per Capita Income Plotted Against Actual Per Capita Income Actual per capita income for (a) Los Angeles, California (b) San Antonio, Texas (c) Seattle, Washington (d) Memphis, Tennessee © Maharana A et al. JAMA Network Open. eFigure 14. Out-of-Sample Model Predictions of Per Capita Income Plotted Against Actual Per Capita Income Plotted against actual per capita income for (a) Los Angeles, California (b) Memphis, Tennessee (c) San Antonio, Texas and (d) Seattle, Washington © Maharana A et al. JAMA Network Open. eFigure 15. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income © Maharana A et al. JAMA Network Open. (a), (c) actual and (b), (d) cross-validated estimates of per capita income for San Antonio, Texas and Los Angeles, California respectively, based on the features extracted from satellite images. The unit is in thousands of dollars. © Maharana A et al. JAMA Network Open. eFigure 16. Actual Per Capita Income and Cross-validated Model Estimates of Per Capita Income (a), (c) actual and (b), (d) cross-validated estimates of per capita income for Bellevue (i), Seattle (ii) and Tacoma (iii), Washington, and Memphis, Tennessee respectively, based on the features extracted from satellite images. The unit is in thousands of dollars. © Maharana A et al. JAMA Network Open. eReferences 1. CDC (2017) 500 Cities: Local Data for Better Health. Available at: https://www.cdc.gov/500cities/ [Accessed September 13, 2017]. 2. CDC (2016) Overweight & Obesity: Adult Obesity Facts. Available at: https://www.cdc.gov/obesity/data/adult.html [Accessed May 19, 2017]. 3. CDC (2016) 500 Cities: Local Data for Better Health. Unhealthy Behaviors. Available at: https://www.cdc.gov/500cities/definitions/unhealthy-behaviors.htm [Accessed September 13, 2017]. 4. The State of Obesity. Obesity Rates & Trends - Adult Obesity in the United States (2017) Available at: https://stateofobesity.org/rates/ [Accessed October 14, 2017]. 5. Nsoesie EO, Butler P, Ramakrishnan N, Mekaru SR, Brownstein JS (2015) Monitoring Disease Trends using Hospital Traffic Data from High Resolution Satellite Imagery: A Feasibility Study. Sci Rep 5:9112. 6. Karnieli A, Gilad U, Ponzet M, Svoray T, Mirzadinov R, Fedorina O (2008) Assessing land-cover change and degradation in the Central Asian deserts using satellite image processing and geostatistical methods. J Arid Environ 72(11):2093–2105. 7. Jean N, Burke M, Xie M, Davis WM, Lobell DB, Ermon S (2016) Combining satellite imagery and machine learning to predict poverty. Science 353(6301):790–794. 8. Mohanty SP, Hughes DP, Salathé M (2016) Using deep learning for image-based plant disease detection. Front Plant Sci 7. 9. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature © Maharana A et al. JAMA Network Open. 542(7639):115–118. 10. Albert A, Kaur J, Gonzalez M (2017) Using convolutional networks and satellite imagery to identify patterns in urban environments at a large scale. ArXiv Prepr ArXiv170402965. 11. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: A large-scale hierarchical image database. Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on (IEEE), pp 248–255. 12. Sharif Razavian A, Azizpour H, Sullivan J, Carlsson S (2014) CNN features off-the- shelf: an astounding baseline for recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 806–813. 13. Friedman J, Hastie T, Tibshirani R (2001) The elements of statistical learning (Springer series in statistics Springer, Berlin). 14. Chatfield K, Simonyan K, Vedaldi A, Zisserman A (2014) Return of the devil in the details: Delving deep into convolutional nets. ArXiv Prepr ArXiv 14053531. 15. Maaten L van der, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9(Nov):2579–2605. 16. R Core Team (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/. © Maharana A et al. JAMA Network Open.
JAMA Network Open – American Medical Association
Published: Aug 31, 2018
You can share this free article with as many people as you like with the url below! We hope you enjoy this feature!
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.