| title | subtitle | size_bytes_ |
|---|---|---|
| 1k Pharmaceutical Pill Image Dataset | Speckled pills CNN feature extracted for unique individual identification | 7641599 |
| 2017 census data for 4chan's fitness board | Greetings /fit/izens! Do you even lift? | 118761 |
| 2018 calorie, exercise and weight changes | Daily calorie intake and exercise information with daily weight loss/gain | 1984 |
| 30000+ healthcare jobs from eMedCareers (Europe) | This dataset contains 30000 latest job postings in Europe from eMedCareers. | 23373744 |
| 40,000+ Healthcare Job Postings from eMedCareers | This dataset contains 40,000 latest job postings in Europe from eMedCareers. | 37519399 |
| 6 Months Daily Diabetes Measures | Glucose, insulin and other data from a single diabetes patient | 10891 |
| Activity recognition with healthy older people | 645405 | |
| Add Health Historical Data - 1993/4 Sample | A sample of the public data from AddHealth historical data-sets | 7083396 |
| Adverse Food Events | 90,000 product-related user-reported adverse medical events | 3326343 |
| Air pollutant PM2.5 and PM10 of India | Useful for analysis of health effect. | 997403 |
| Air pollution in Skopje from 2008 to 2018 | Skopje is one of the most polluted cities in the world recently. Here's the data | 6639421 |
| Air quality data from extensive network of sensors | PM1, PM2.5, PM10, temp, pres and hum data for 2017 year from Krakow, Poland | 2391592 |
| Alcohol and Drug Consumption of German Teens | Do modern teens prefer weed over cigarettes? | 4823 |
| Animal Bites | Data on over 9,000 bites, including rabies tests | 100280 |
| Annual HIV Deaths | 3557 | |
| ASD-ChildrenBlood-GeneExpressionData | 782 | |
| Autism Screening | Classifying autistic patients based upon the screening results. | 7474 |
| Automatic generation of Guard roles | lets build an primer for an automated guard distribution system | 65154 |
| Bad teeth, sugar and government health spending | Thanks to Gapminder Data | 114422 |
| Behavioral Risk Factor Surveillance System | Public health surveys of 400k people from 2011-2015 | 401719076 |
| Bioassay Datasets | 21 assays from PubChem that measure compound activity | 50143544 |
| Biometrics for stress monitoring | the influence of person-specific stress predictive models | 2777059337 |
| BLE RSSI Accelerometer Indoor Measurements | Indoor RSSI + accelerometer measurements with detailed location annotations | 35445793 |
| BMI_6334 Sample Diab Ret | Diabetic Retinopathy Sample | 16830018893 |
| BosLab Backyard Microbiome | BosLab analyzes local soil samples; runs NextGen (?) Seq to I.D. microbiome | 810347470 |
| Brain MRI segmentation | Brain MRI images together with manual FLAIR abnormality segmentation masks | 350584108 |
| Breast Cancer Proteomes | Dividing breast cancer patients into separate sub-classes | 5680320 |
| Breast Cancer Wisconsin - Data Set | 49928 | |
| Breast Cancer Wisconsin (Diagnostic) Data Set | Predict whether the cancer is benign or malignant | 49196 |
| breast sizes | breast sizes to ease the search for the perfect bra | 323122 |
| BRFSS 2001-2010 | Behavioral Risk Factor Surveillance System for 2001-2010 | 574779431 |
| California DDS Expenditures | Exploring Simpson's Paradox | 9594 |
| California Kindergarten Immunization Rates | How many new students contributed to herd immunity between 2000 and 2015? | 1322112 |
| Cancer Inhibitors | Predict small molecules' activity targeting protein kinase | 108435307 |
| Cancerlectins | CancerLectin and Non Cancerlectin Data in these files | 164594 |
| Cannabis Strains | Marijuana strain dataset | 424888 |
| Cardiovascular Disease dataset | The dataset consists of 70 000 records of patients data, 11 features + target. | 738432 |
| CAT Scan Localization | 384 features extracted from CT images | 18173898 |
| CDC Data: Nutrition, Physical Activity, & Obesity | Obesity Trends in US | 1225167 |
| Cervical Cancer Risk Classification | prediction of cancer indicators; Please download; run kernel & upvote | 7944 |
| ChEMBL EBI Small Molecules Database | A large-scale bioactivity database for drug discovery (BigQuery) | 16780273015 |
| Chemical Substance Registry (CAS registry numbers) | The EPA's Toxic Substances Control Act Chemical Substance Inventory | 2457467 |
| Chest X-Rays Dataset | Contains folder wise arranged images for 5 diseases. | 132247373 |
| Chest Xray Masks and Labels | Pulmonary Chest X-Ray Defect Detection | 5141014569 |
| Chicago Dept of Public Health Clinic Locations | From City of Chicago Open Data | 200211 |
| Chicago Flu Shot Clinic Locations - 2012 | From City of Chicago Open Data | 44447 |
| Chicago Primary Care Community Health Centers | From City of Chicago Open Data | 175775 |
| Chicago Public Health Department Events | From City of Chicago Open Data | 51478 |
| Chicago Public Health Statistics | From City of Chicago Open Data | 2357434 |
| Chicago Restricted Flavored Tobacco Products | From City of Chicago Open Data | 15093 |
| Chicago West Nile Virus Mosquito Test Results | From City of Chicago Open Data | 585505 |
| Childhood Blood Lead Surveillance | National and state-level surveillance data, 1997 to 2015 | 137994 |
| Chronic Disease Indicators | Disease Data Across the US, 2001-2016 | 9229731 |
| Chronic illness: symptoms, treatments and triggers | How do treatments and environmental stressors impact symptoms? | 20209227 |
| Clinical, Anthropometric & Bio-Chemical Survey | 1.89m records & 53 variables of unit level annual health survey data from India. | 51369141 |
| CMS Innovation Center Data | Explore open data from the CMS | 895189 |
| CMS Open Payments Dataset 2013 | Creating Public Transparency into Industry-Physician Financial Relationship | 306290992 |
| comparis.ch challenge | Swiss healthcare premium prediction | 3458647 |
| Confused student EEG brainwave data | EEG data from 10 students watching MOOC videos | 114236807 |
| Contraceptive product reviews from webmd.com | 2513976 | |
| Crop Nutrient Database | USDA data about crop nutrients in the U.S. | 55224 |
| Crowdedness at the Campus Gym | Number of attendees every 10 minutes from the last year in the gym | 611662 |
| CT Medical Images | CT images from cancer imaging archive with contrast and patient age | 261215866 |
| CT Scans: Before and After | A dataset for evaluating registration algorithms on medical images | 116309588 |
| Daily total female births in California, 1959 | Female Births CA 1959 | 1446 |
| DDSM Mammography | tfrecords files of scans from the DDSM dataset | 2874598043 |
| Dengue Cases in the Philippines | Monthly and Regional Cases of Dengue per 100,000 Population from 2008 to 2016 | 14603 |
| Dengue, Temperatura e Chuvas em Campinas-SP | Dados Mensais coletados entre 1998 e 2014 | 2556 |
| Deodorant Liking Dataset | For Product liking Prediction | 216730 |
| Descriptores Nucleos de Celulas | Descriptores obtenidos a partir de los datos del 2018 Data Science Bowl | 57444 |
| Determine Insulin intake for a diabetic | Try to advice insuline intake for a single patient | 20779 |
| Diabetes 130 US hospitals for years 1999-2008 | Diabetes - readmission | 4623973 |
| Diabetic_Dataset | Predicting Model of Readmitted Patients | 3314579 |
| Diagnose Specific Language Impairment in Children | Explore and create models using data derived from transcripts in CHILDES | 239917 |
| Discretized Datasets by Mangrove | Automatic Machine learning on diverse datasets | 49547267 |
| DNA combines History, Admixture, and Genealogy | My Personal DNA Journey | 1418798 |
| Doctor and lawyer profiles on Avvo.com | 20,000 doctor and lawyer profiles | 1603851 |
| Dr. Semmelweis Handwashing Survey Data | Source: Datacamp.com | 944 |
| DUI Arrests, alcohol/vehicle deaths, USA, 2015 | 1071 | |
| Early Biomarkers of Parkinson's Disease | Early biomarkers of Parkinson's disease based on natural connected speech | 66121 |
| Ebola Cases, 2014 to 2016 | Ebola data in record format with indicator, country, date and value | 82757 |
| ECG Heartbeat Categorization Dataset | Segmented and Preprocessed ECG Signals for Heartbeat Classification | 103633608 |
| echocardiogram-UCI | health issues and survival rate | 2213 |
| EEG Brainwave Dataset: Feeling Emotions | Positive and Negative emotional experiences captured from the brain | 12484187 |
| EEG brainwave dataset: mental state | Relaxed, Neutral, and Concentrating brainwave data | 52406657 |
| EEG data from basic sensory task in Schizophrenia | Button press and auditory tone event related potentials from 81 human subjects | 8480657321 |
| EEG data from sensory task in Schizophrenia (2) | Additional event related potential data from human subjects | 8560504925 |
| EEG_Clean | 427931 | |
| Emoji Diet Nutritional Data | Nutritional data per every gram of an emoji food dataset | 5570 |
| England Obesity Stats 2017 | This stat covers gender, regions, age groups, years 2005/6 till 2015/16 | 230559 |
| Example brain mapping dataset | Which part of the brain is involved in moving your lips? | 144671816 |
| Extension to Multidimensional Poverty Index | Indexing different types of simultaneous deprivation | 19427 |
| Eye disease dataset | Predicting Human eye diseases | 2547104 |
| Factors Affecting Early Adult Lung Function | Factors Affecting Early Adult Lung Function | 250632 |
| Factors Affecting Early Adult Lung Function 2 | 41658 | |
| Fall Detection Data from China | Activity of elderly patients along with their medical information | 270248 |
| FCN Trained on DDSM Images | 370959506 | |
| FDA Enforcement Actions | Food, drug, and medical device enforcements | 195971476 |
| Feline reticulocytes | Microscopy images of different cell types | 51357729 |
| Fertility Data Set | Predict if Normal or altered | 1043 |
| Finding and Measuring Lungs in CT Data | A collection of CT images, manually segmented lungs and measurements in 2/3D | 554684736 |
| Fitness Trends Dataset | A dataset of fitness trends and how they change with exercise | 1113 |
| Floodlight MS Dataset | Understanding Daily MS changes through smartphone data | 1222206 |
| Food choices | College students' food and cooking preferences | 5505473 |
| Foodborne Disease Outbreaks, 1998-2015 | What contaminant has caused the most hospitalizations and fatalities? | 234033 |
| Future medical event | 5998849 | |
| Gender Health Tunisia | Health Data by Gender in Tunisia | 2302 |
| Gene expression dataset (Golub et al.) | Molecular Classification of Cancer by Gene Expression Monitoring | 1504052 |
| General Practice Prescribing Data | One year of British National Health Service Prescription data | 1769157832 |
| Genetic medication selection awareness | Machine learning diagnostics for mental health | 22467 |
| Genetic Variant Classifications | Predict whether a variant will have conflicting clinical classifications. | 3597726 |
| Genome Information For Sequenced Organisms | taxonomy, statistics and dna sequence links for species with sequenced genomes | 1665565 |
| Ghana Health Facilities | Ownership & Types of Health Facilities in Ghana | 85790 |
| Ghana Health Facilities | Ownership location and types of health facilities in Ghana | 72985 |
| Global suicide data | 136651 | |
| HCC dataset | Hepatocellular Carcinoma Dataset | 2047887 |
| HCC survival data set | " Hepatocellular Carcinoma dataset (HCC dataset) " | 7867 |
| HCPCS Level II | Healthcare Common Procedure Coding System (HCPCS) Level II (BigQuery Dataset) | 3200160 |
| HDI Brazil (IDH Brasil) | Human Development Indexes and Census data for Brazilian municipalities | 8463048 |
| Health Analytics | 26 health indicators (642 variables) from 9 states and 284 districts of India | 913478 |
| Health and Nutrition Data from One Individual | Started January 1, 2019 | 22678 |
| Health Care Access/Coverage for 1995-2010 | Prevalence and Trends of Health Care Acess | 19145 |
| Health Care Searches By Metro Area in the US | Metropolitan area data on search interest in healthcare | 3908 |
| Health Insurance Coverage | Coverage rates before and after the Affordable Care Act | 2938 |
| Health insurance data | Classified by type of disease and type of insured. | 10090 |
| Health Insurance Marketplace | Explore health and dental plans data in the US Health Insurance Marketplace | 816682245 |
| Health Nutrition and Population Statistics | State of human health across the world | 14842664 |
| Health searches by US Metropolitan Area, 2005-2017 | Data from Google trends showing who searches for what and where | 33212 |
| HEALTHCARE PROVIDER FRAUD DETECTION ANALYSIS | HEALTHCARE PROVIDER FRAUD DETECTION ANALYSIS | 25582545 |
| Heart Attack Prediction | This file describes the contents of the heart-disease directory. | 2634 |
| Heart Disease and Stroke Prevention | Provides a comprehensive image for cardiovascular diseases & related prevention | 1181198 |
| Heart Disease Dataset | Public Health Dataset | 5728 |
| Heart Disease UCI | https://archive.ics.uci.edu/ml/datasets/Heart+Disease | 3438 |
| Heart rate throughout a day | Heart rate throughout a day measured through fitbit smartwatch | 49851 |
| Heartbeat Sounds | Classifying heartbeat anomalies from stethoscope audio | 116082239 |
| Help Blind Community to walk | Predict the action of walk after recognizing view in the image | 104104189 |
| Help me predict my pain | Find patterns in my pain diary | 9714 |
| Hepatitis B Virus Levels of Patients (Re-upload) | 2915 | |
| High-Content Screening with C.Elegans | A small, fully annotated dataset for getting starting with HCS analysis | 69336657 |
| Hindi Health Dataset | HHD Corpus worthy for NLP tasks | 264509 |
| Home Medical Visits - Healthcare | Collection of the visits of a Home Medical Services Company during 2 years | 1520903 |
| Hospital Charges for Inpatients | How inpatient hospital charges can differ among different providers in the US | 8146863 |
| Hospital General Information | General information & quality ratings for almost all US hospitals | 385027 |
| Hospital Payment and Value of Care | Government data about payment and quality of care | 1273532 |
| Hospital ratings | The official dataset used on Medicare.gov for hospital quality comparison | 270740 |
| Hospital Triage and Patient History Data | To predict hospital admission at the time of ED triage | 100257023 |
| Human Development Index | Human Development Index dataset with all sub indices | 365782 |
| Human genome annotation in CSV | genome annotation denormalized and stored in CSV format | 33909284 |
| human-capital-index | mirror from world bank | 1147463 |
| ICD10 Data | ICD-10 Scraped Web Pages using https://github.com/shams-sam/ICD10Data.com | 349495895 |
| Images of Canine Coccidiosis Parasite | Microscopy images of Isospora canis oocysts | 18969419 |
| Immunization Data in India | Immunization Data for the year of 2017-1018 from India | 126282 |
| INDIA and it^s numbers | Explore india and it^s people with their data | 2225 |
| India Census 2011 | Demographic Census Data for India | 623692 |
| India census yearly data | India census data along with elementary school, crime , health & gdp data | 13902117 |
| India Primary health care data | This is the data source of India primary health care in PHC, CHC and Subcentre. | 22621 |
| Indian Liver Patient Dataset | Data about Liver Patients in India | 7733 |
| Indian Liver Patient Records | Patient records collected from North East of Andhra Pradesh, India | 7760 |
| Industrial Safety and Health Analytics Database | Industrial labor accident data | 163870 |
| Infant Mortality, Fertility, Income | Historical data on infant mortality, fertility and income. | 321040 |
| Integrated Heart Disease Dataset | Contains all 4 databases from Heart Disease database at UCI repository. | 35229 |
| Jamaican Epidemiology Statistics | Islandwide Fever and Respiratory Symptoms | 5318 |
| journal | Plain text extracted from journal articles | 65517 |
| LA County Restaurant Inspections and Violations | Environmental health inspections and violations in LA County restaurants | 19028598 |
| LA Restaurant & Market Health Data | From Los Angeles Open Data | 12098527 |
| la-county-restaurant-inspections-and-violations | Environmental health inspection and voilation in la country restaurants | 19028598 |
| Life Expectancy (WHO) | Statistical Analysis on factors influencing Life Expectancy | 120343 |
| Life Expectancy of the World | Country wise Overall/Male/Female Life Expectancy Data | 3624 |
| Localization Data for Posture Reconstruction | Recordings of five people while wearing localization tags | 6596910 |
| Logistic Regression - Heart Disease Prediction | Prediction of Coronary Heart Disease | 56782 |
| Logistic regression To predict heart disease | heart disease prediction | 56754 |
| LogP of Chemical Structures | Predict properties based on chemical structure | 81428 |
| Lower Back Pain Symptoms Dataset | Collection of physical spine data | 20262 |
| Lung Masks for Shenzhen Hospital Chest X-ray Set | Manually Segmented Lungs Masks for Shenzhen Hospital Chest X-ray Set | 10243652 |
| Lung Nodule Malignancy | From suspicious nodules to diagnosis | 98188509 |
| Malaria Cell Images Dataset | Cell Images for Detecting Malaria | 353452851 |
| Malarial Mosquito Database | Geo-coded Inventory of Anophelines in the Sub-Saharan Africa: 1898-2016 | 1001255 |
| Mass Shootings in U.S | 1966-2017 with over 30 variables | 173398 |
| Measurements of urine pH | 35815 | |
| Medical Appointment No Shows | Why do 30% of patients miss their scheduled appointments? | 2585351 |
| Medical Cost Personal Datasets | Insurance Forecast by using Linear Regression | 16385 |
| Medical Examiner Case Archive | Data Provided by Cook County, IL Medical Examiner | 1754838 |
| Medical Speech, Transcription, and Intent | Audio utterances paired with text for common medical symptoms | 2817552937 |
| Medicare Data | Medicare Data (BigQuery Dataset) | 13816627778 |
| Medicare Provider Data 2016 Part D Prescriber | 1100278321 | |
| medicare_provider_inpatient | Data on Medicare Payments | 9902224 |
| Medicare's Doctor Comparison Scores | The 2017 Physican Compare Database | 254068537 |
| Meditation-EEG-Data | two data files of EEG recordings, one meditation and one baseline | 490883 |
| MEDLINE and MeSH | Biomedical bibliometric data and paper classification | 3772670852 |
| Mental Health in Tech Survey | Survey on Mental Health in the Tech Workplace in 2014 | 47244 |
| Mental health in technology survey: Raw data 2014 | Survey data downloaded directly from OSMI with no pre-processing | 339026 |
| MESSIDOR-2 DR Grades | Adjudicated DR Severity, DME, and Gradability for the MESSIDOR-2 fundus dataset | 7451 |
| MIAS Mammography | Looking for breast cancer | 215170043 |
| Microsoft Data Science Capstone | Predict heart disease rate (Regression) | 503642 |
| Mortality Projection by Worldwide Health Org. | Projections of mortality and causes of death, 2015 and 2030 | 4490525 |
| MRI and Alzheimers | Magnetic Resonance Imaging Comparisons of Demented and Nondemented Adults | 12845 |
| Multidimensional Poverty Measures | Indexing different types of simultaneous deprivation | 19633 |
| Multidimensional Poverty Measures | Harmonized Dataset for Comparisons Over Time | 50099 |
| My Sleep Log | My sleep log for 3+ monthes | 1957 |
| National Health and Nutrition Examination Survey | NHANES datasets from 2013-2014 | 7064993 |
| New York City Air Quality | From New York City Open Data | 86289 |
| New York City Farmers Markets | From New York City Open Data | 207764 |
| New York City HHC Facilities | From New York City Open Data | 15644 |
| New York City REACH Members | From New York City Open Data | 262759 |
| New York Fire Department | From New York City Open Data | 195190 |
| Non-invasive Blood Pressure Estimation | Vital signals and reference blood pressure values acquired from 26 subjects. | 38043388 |
| NPPES Plan and Provider Enumeration System | The CMS National Plan and Provider Enumeration System Data (BigQuery Dataset) | 17485078932 |
| NSDUH 2016 | National Survey on Drug Use and Health Dataset (2016) | 32623538 |
| NTR Vaidya Seva 2017 | Healthcare data from the Indian state of Andhra Pradesh (anonymized) | 24696762 |
| nucleus | Microscope image cell nuclei cluster dataset | 5239115 |
| Nursing Home Compare | Comparing the quality of care of over 15,000 nursing homes in the U.S. | 32571807 |
| Nutrition Facts for McDonald's Menu | Calories, fat, and sugar for every cheeseburger, fries, and milkshake on menu | 7540 |
| NY Child health plus income levels | From New York City Open Data | 62001 |
| NY Community Health Centers & Survey | From New York City Open Data | 322795 |
| NY Current Reservoir Levels | From New York City Open Data | 120240 |
| NY Department of Health and Mental Hygiene | From New York City Open Data | 12636365 |
| NY Rodent Inspection | From New York City Open Data | 119104789 |
| NYC Health and Hospitals Corp Patient Satisfaction | From New York City Open Data | 3881 |
| NYC Locations Providing Seasonal Flu Vaccinations | From New York City Open Data | 268185 |
| NYC Most Popular Baby Names | From New York City Open Data | 152502 |
| NYS Bridges To Health Service Agencies | From New York State Open Data | 152571 |
| NYS Mental Health Information | From New York State Open Data | 968759 |
| NYS Patient Characteristics Survey (PCS): 2015 | From New York State Open Data | 3775355 |
| NYS PSYCKES Antipsychotic Polypharmacy Quality | From New York State Open Data | 4925344 |
| NYS Substance Use Disorder Data | From New York State Open Data | 796639 |
| OAK Chronic Disease Preventable Hospitalizations | Explore open data from the city of Oakland | 4774 |
| Oakland Emergency Department Visits | Explore open data from the city of Oakland | 5900 |
| Oakland Equity Indicators - Health & Wellness | Explore open data from the city of Oakland | 19287 |
| Oakland Equity Indicators - Housing & Rent | Explore open data from the city of Oakland | 30287 |
| Oakland Equity Indicators - Policing & Crime | Explore open data from the city of Oakland | 16439 |
| Oakland Parks and Recreation Facilities & Ratings | Explore open data from the city of Oakland | 14195 |
| Oakland Rate of Acute Preventable Hospitalizations | Explore open data from the city of Oakland | 4786 |
| Open Units | Number of units of alcohol in branded drinks in a variety of standard servings | 15866 |
| OpenPowerlifting | Contributions to the OpenPowerlifting Project | 9248065 |
| Oregon Hospital Data | A dataset highlighting procedure costs across various hospitals in Oregon, USA | 9270 |
| Orlando Crimes | 6186431 | |
| Orlando Police Shootings | Officer-Involved Shootings in the City of Orlando, Florida | 6278 |
| OSMI Mental Health in Tech Survey 2016 | Data on prevalence and attitudes towards mental health among tech workers | 14345902 |
| OSMI Mental Health in Tech Survey 2017 | Data on prevalence and attitudes towards mental health among tech workers | 219149 |
| Parkinson's Drawings | Distinguishing Different Stages of Parkinson s Disease | 21801184 |
| Periodic Table of Elements Mapped to Stocks | Elements & Minerals with known and hidden relationships to Stocks | 1024714 |
| Pima Indians | A study of diabetes through health information | 5312 |
| Pima Indians Diabetes Database | Predict the onset of diabetes based on diagnostic measures | 9077 |
| pima-indians-diabetes.data | pima-indians-diabetes-dataset | 8971 |
| Pixels's intensity of positive and negative nuclei | dataset to detect 20x20 pixel nuclei in histological images | 552721 |
| Plantar Fasciitis | data-set on clinical data of patients with plantar fasciitis | 3665 |
| Pneumonia Chest X ray | 1222457547 | |
| Pollution in Atchison Village, Richmond CA | Pollution and Wind Data from August to November 2015 | 259790 |
| PPG Heart Beat for Cognitive Fatigue Prediction | Raw PPG waves & annotations from subjects playing computer games for 22 hours | 175638541 |
| Predict Mortality/Death Rate. | 770k records & 121 variables of unit level survey data collected from 9 States. | 58544198 |
| Predict Outcome of Pregnancy | Unit level survey data collected from 9 states. | 593020193 |
| Prescription-based prediction | Predicting doctor attributes from prescription behavior | 33294494 |
| Project Tycho: Contagious Diseases | Weekly case reports for polio, smallpox, and other diseases in the United States | 3038548 |
| Protein Secondary Structure | Curated dataset for protein secondary structure prediction | 40687626 |
| ProteinSubcellularLocalization | ProteinClassification | 1009281 |
| Published Articles in Nature Research_2019 | First Quarter of 2019 | 435173 |
| Raman spectroscopy of Diabetes | Raman Spectroscopy to Screen Diabetes Mellitus with Machine Learning Tools | 1360278 |
| Random Sample of NIH Chest X-ray Dataset | 5,606 images and labels sampled from the NIH Chest X-ray Dataset | 2253039803 |
| Respiratory Sound Database | Use audio recordings to detect respiratory diseases. | 1979569023 |
| RMU Dissertation - Final Data File | Healthcare Data Analytics | 1502010 |
| RSNA Bone Age | Predict Age from X-Rays | 9972015873 |
| Running Heart Rate Recovery | Heart rate stop and start event data | 167558 |
| RxNorm Data | National Library of Medicine RxNorm Data (BigQuery Dataset) | 26378961799 |
| salesDB | DB of grocery sales 6M+ | 485754004 |
| salesDB grocery market | sales grocery market | 485754004 |
| School Shootings US 1990-present | Record of all school shooting incidents since 1990 | 41699 |
| SciSpacy Pre-trained Models | Models related to using spaCy for scientific documents | 516926453 |
| Seattle Youth and Family Initiative | From City of Seattle Open Data | 48757 |
| Segmenting Soft Tissue Sarcomas | A challenge to automate tumor segmentation | 320765902 |
| Seizure Data | Patterns and Predictions | 10340 |
| Self-tracking | Collection of selected self-tracking measures of the author | 8990 |
| Severely Injured Workers | ~22k Injury Reports for US Workers, 2015-2017 | 3659164 |
| SF Department of Public Health Flu Shot Locations | From San Francisco Open Data | 4170 |
| Sign Language MNIST | Drop-In Replacement for MNIST for Hand Gesture Recognition Tasks | 32286533 |
| siim-acr-pneumothorax-segmentation.zip dataset | Complete Dataset (DICOM + PNG + Label) | 7905041642 |
| Single Cell RNA Seq from Stoeckius et al. 2017 | https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE100866 | 1553343 |
| Sleep Data | Personal Sleep Data from Sleep Cycle iOS App | 17114 |
| Sleep Patterns of Railroad Dispatchers | How well Railroad Dispatchers Sleep | 21766 |
| Structural MRI Datasets (T1, T2, FLAIR etc.) | Coursera NeuroHacking in R course datasets | 198797820 |
| Structural Protein Sequences | Sequence and meta data for various protein structures | 28782695 |
| Student Alcohol Consumption | Social, gender and study data from secondary school students | 17972 |
| Suicides in India | Sucides in each state is classified according to various parameters from 2001-12 | 1381876 |
| Surgical complications (Canterbury, NZ, 2014-2018) | Obtained using the Official Information Act | 6535 |
| Sutton Health Charges (2019) | Price list of services provided by Sutton Health | 2173709 |
| SWELL dataset | Heart Rate Variability (HRV) dataset for research on stress and user modeling | 121584248 |
| Symptoms Corpus | A corpus which details specific symptoms suffered by humans. | 28939 |
| Synchronized Brainwave Dataset | Brainwave recordings from a group presented with a shared audio-visual stimulus | 25404627 |
| Synthetic Cell Images and Masks - BBBC005_v1 | Build models to segment and count cells. | 1893724776 |
| Tanzanian Health Facility Registry Dataset | Tabular form of official Tanzanian Health Facility Registry data | 4183218 |
| The fight against malaria | Who is dying and being saved from this destructive disease? | 429826 |
| The Human Microbiome Project | 79629 | |
| Tick Heamaphysalis punctata | Heamaphysalis punctata larva | 4286330 |
| Tobacco Use 1995-2010 | Prevalence and Trends: Four Level Smoking Data | 13406 |
| Tobacco Use and Mortality, 2004-2015 | Hospital admissions, prescriptions, and fatalities in England | 29277 |
| Trypophobia | 6k of handchecked, normalized trypophobia triggering images | 1817491598 |
| tweets | tweets related to dental care affordability and dental + opioid use | 24069092 |
| U.S. Chronic Disease Indicators (CDI) | Chronic Disease Dataset | 10163804 |
| U.S. Healthcare Data | Population Health, Diseases, Drugs, Nutritions, Health-plans | 37546602 |
| U.S. Opiate Prescriptions/Overdoses | Can you save lives through predictive modeling? | 1900762 |
| Ulaanbaatar Particulate Matter Pollution 2015-2018 | PM2.5 & PM10 measurements from 18 stations in Ulaanbaatar, Mongolia | 8127958 |
| Unemployment and mental illness survey | Exploring the causation of high unemployment among the mentally ill | 108499 |
| United States Births by day 1994-2003 | US Births CDC 1994 - 2014 | 188394 |
| US Veteran Suicides | 2005-2011 veteran deaths outside of combat by state | 28713 |
| USA Hospitals | Hospitals across the USA | 739211 |
| USP Drug Classification | Medical drug code classes & metadata | 26574 |
| USPTO Cancer Moonshot Patent Data | Discover patent trends and innovations in cancer research (BigQuery) | 100717743 |
| VIA/I-ELCAP Lung CT Database | 30 low-dose documented whole-lung CT scans | 3740235634 |
| WISDM_ar_v1.1_raw | 11196617 | |
| Women Health Care | Multi label dataset with 14 targets | 7035324 |
| word2vec reddit medication model | 42575356 | |
| World Bank: Education Data | World Bank: Education Data (BigQuery Dataset) | 628675690 |
| World Bank: GHNP Data | World Bank: Global Health, Nutrition, and Population Data (BigQuery Dataset) | 260638540 |
| World Bank: International Debt Data | World Bank: International Debt Data (BigQuery Dataset) | 139420838 |
| World Developmet indicators(2008-2015) | 187489 | |
| Zika Virus Epidemic | Analyze the ongoing spread of this infectious disease | 832097 |