Eric J. Daza, DrPH, MPS

Menlo Park, California, United States
7K followers 500+ connections

View mutual connections with Eric J.

Welcome back

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

or

New to LinkedIn? Join now

Join to follow

Keep AI

Stanford University School of Medicine

About

I help you use your own data to learn about yourself. How? Thorough statistical and…

Contributions

What are the statistical analysis tools that can be used to analyze survey data?

Intensive longitudinal data approaches can be appropriate for survey outcomes collected very frequently. These include time-varying coefficient models and multilevel time series models. See: https://statsof1.org/resources/#aggregated-n-of-1-analyses —in particular, Walls and Shafer (2006).

Eric J. Daza, DrPH, MPS contributed 2 months ago Upvote
What are the most effective ways to transition to a career in statistics?

'While “being a data scientist” requires knowing core statistical principles and having basic statistical skills, “being a statistician” in data science can be a liability!' More from my 20+ years as a biostatistian-turned-data-scientist here: https://towardsdatascience.com/why-you-should-think-of-the-enterprise-of-data-science-more-like-a-business-less-like-science-45227c65c09

Eric J. Daza, DrPH, MPS contributed 4 months ago Upvote
What are the most effective ways to transition to a career in statistics?

For folks with doctoral training in particular, it might help to know that I rely on and apply these two core technical concepts all the time: 1. (straight-up statistics) the Law of Total Expectation E(Y) = E{E(Y|X)} 2. (statistics and machine learning connections) the implicit prediction-expectation approximate equivalence E(Y|X) ≈ Y ~ f(X) (i.e., conditional expectation ≈ prediction function)

Eric J. Daza, DrPH, MPS contributed 4 months ago Upvote
What distinguishes the job responsibilities of a statistician from those of a data analyst?

"Academic statisticians and clinical-trials biostatisticians rarely (if ever) work as co-investigators; we’re not trained to do so. My academic training tells me that making decisions as a co-investigator is overstepping, maybe even disrespectful (i.e., by sending the message that I don’t trust the scientist, even though it’s clearly not my field of expertise). It’s not a typical responsibility for professional statisticians." https://towardsdatascience.com/why-you-should-think-of-the-enterprise-of-data-science-more-like-a-business-less-like-science-45227c65c09

Eric J. Daza, DrPH, MPS contributed 5 months ago Upvote

Activity

Recent Stats+Stories #Podcast - American Statistical Association - ASA Noel Cressie talks about the WOllongong Methodology for Bayesian Assimilation…

Recent Stats+Stories #Podcast - American Statistical Association - ASA Noel Cressie talks about the WOllongong Methodology for Bayesian Assimilation…

Liked by Eric J. Daza, DrPH, MPS
Evidation Shares Real-World Insights on Weight Management & Healthcare Access at ISPOR 2024 This week, our research team is sharing findings from…

Evidation Shares Real-World Insights on Weight Management & Healthcare Access at ISPOR 2024 This week, our research team is sharing findings from…

Liked by Eric J. Daza, DrPH, MPS
One of the most overlooked behaviors that promotes optimal health is adequate sleep. See NIH study here: https://lnkd.in/e9-p5kYe CDC has just…

One of the most overlooked behaviors that promotes optimal health is adequate sleep. See NIH study here: https://lnkd.in/e9-p5kYe CDC has just…

Liked by Eric J. Daza, DrPH, MPS

Join now to see all activity

Experience

Keep AI
-

San Francisco Bay Area
-

United States
- -
  
  United States
- -
  
  United States
- -
  
  San Francisco Bay Area
- -
  
  San Francisco Bay Area
- -
  
  San Francisco Bay Area
- -
  
  Stanford, CA
- -
  
  Stanford, CA
- -
  
  Stanford, CA
- -
  
  Chapel Hill, NC
- -
-
-
-

Ithaca, New York, United States
-

Education

-

2015 - 2018

(Mentor: Michael Baiocchi.) Trained in data-analytic and machine-learning methods, and methodological research in causal inference. Collaborated within academic and non-academic projects by providing statistical consultation and programming solutions. Developed personalized digital health statistical methods.
-

2007 - 2015

Activities and Societies: rock climbing, Krav Maga, bass guitar, piano, American Statistical Association, International Biometric Society: Eastern North America Region

(Advisor: Michael G. Hudgens. Co-Advisor: Amy H. Herring.) Dissertation: "Longitudinal Regression Conditioning on Continuation"
-

2001 - 2002

Activities and Societies: Risley Theatre, Gateway Theatre, Louie's Lunch
-

1996 - 2000

Activities and Societies: Risley Theatre, Gateway Theatre, Louie's Lunch
-

1992 - 1996

Activities and Societies: Windowpanes, Kairos, The X-Files Club, Chemistry Club, The Knights of Fantasy & Role-Playing
-

1983 - 1987
-

2005 - 2007

biostatistics pre-requisite courses

Volunteer Experience

Chairperson

Chairperson

American Statistical Association - ASA

Jan 2021

Civil Rights and Social Action

Justice Equity Diversity Inclusion (JEDI) Outreach Group (OG) https://datascijedi.org/
— Chair-Elect (current: Jan 2024 — present)
— Professional Development Committee (PDC) Chair (2 years: Jan 2022 — December 2023)
— PDC Member (1 year: Jan 2021 — December 2022)
Volunteer

Volunteer

Braver Angels

Nov 2022

Civil Rights and Social Action

I help with statistical analysis and data science.

Licenses & Certifications

Interviewing Techniques

LinkedIn

Issued Apr 2023

See credential
Inclusive Mindset

LinkedIn

Issued Mar 2023

See credential
Unconscious Bias

LinkedIn

Issued Apr 2021

See credential

Publications

Methods and Systems for Generating Personalized Treatments via Causal Inference

United States Patent and Trademark Office February 8, 2024

Provided herein are systems, methods, computer-readable media, and techniques for generating a personalized recommended intervention for a subject based on causal inference, including: obtaining a first set of time series data and a second set of time series data, the first set of time series data relating to a first variable indicative of a health behavior of the subject, and the second set of time series data relating to a second variable indicative of a health condition of the subject;…

Provided herein are systems, methods, computer-readable media, and techniques for generating a personalized recommended intervention for a subject based on causal inference, including: obtaining a first set of time series data and a second set of time series data, the first set of time series data relating to a first variable indicative of a health behavior of the subject, and the second set of time series data relating to a second variable indicative of a health condition of the subject; determining a causal effect of the first variable on the second variable by estimating an average treatment effect, wherein the average treatment effect is estimated by processing the first set of time series data and the second set of time series data using a model-twin randomization method; and generating a personalized treatment or intervention recommendation for the subject to change the health condition based on the causal effect.

See publication
Patient and Physician Perspectives on the Use of a Connected Ecosystem for Diabetes Management: International Cross-Sectional Observational Study

JMIR Form Res November 30, 2023
Collaboration between people with type 2 diabetes (T2DM) and their health care teams is important for optimal control of the disease and outcomes. Digital technologies could potentially tie together several health care-related devices and platforms into connected ecosystems (CES), but attitudes about CES are unknown.

We surveyed convenience samples of patients and physicians to better understand which patient characteristics are associated with higher likelihoods of (1) participating in…

Collaboration between people with type 2 diabetes (T2DM) and their health care teams is important for optimal control of the disease and outcomes. Digital technologies could potentially tie together several health care-related devices and platforms into connected ecosystems (CES), but attitudes about CES are unknown.

We surveyed convenience samples of patients and physicians to better understand which patient characteristics are associated with higher likelihoods of (1) participating in a potential CES program, as self-reported by patients with T2DM and (2) clinical benefit from participation in a potential CES program, as reported by physicians.

Other authors
See publication
What possibly affects nighttime heart rate? Conclusions from N-of-1 observational data

Digital Health August 24, 2022
Results

We found that physical activity can increase the nighttime heart rate amplitude, whereas there were no strong conclusions about its suggested effect on total sleep time. Self-reported states such as exercise, yoga, and stress were associated with increased (for the first two) and decreased (last one) average nighttime heart rate.

Conclusions

This study implemented the MoTR method evaluating the suggested effects of daily stressors on nighttime heart rate, sleep…

Results

We found that physical activity can increase the nighttime heart rate amplitude, whereas there were no strong conclusions about its suggested effect on total sleep time. Self-reported states such as exercise, yoga, and stress were associated with increased (for the first two) and decreased (last one) average nighttime heart rate.

Conclusions

This study implemented the MoTR method evaluating the suggested effects of daily stressors on nighttime heart rate, sleep time, and physical activity in an individualized way: via the N-of-1 approach. A Python implementation of MoTR is freely available.

Other authors
See publication
Model-Twin Randomization (MoTR): A Monte Carlo Method for Estimating the Within-Individual Average Treatment Effect Using Wearable Sensors

arXiv August 1, 2022
Temporally dense single-person "small data" have become widely available thanks to mobile apps and wearable sensors. Many caregivers and self-trackers want to use these data to help a specific person change their behavior to achieve desired health outcomes. Ideally, this involves discerning possible causes from correlations using that person's own observational time series data. In this paper, we estimate within-individual average treatment effects of physical activity on sleep duration, and…

Temporally dense single-person "small data" have become widely available thanks to mobile apps and wearable sensors. Many caregivers and self-trackers want to use these data to help a specific person change their behavior to achieve desired health outcomes. Ideally, this involves discerning possible causes from correlations using that person's own observational time series data. In this paper, we estimate within-individual average treatment effects of physical activity on sleep duration, and vice-versa. We introduce the model twin randomization (MoTR; "motor") method for analyzing an individual's intensive longitudinal data. Formally, MoTR is an application of the g-formula (i.e., standardization, back-door adjustment) under serial interference. It estimates stable recurring effects, as is done in n-of-1 trials and single case experimental designs. We compare our approach to standard methods (with possible confounding) to show how to use causal inference to make better personalized recommendations for health behavior change, and analyze 222 days of Fitbit sleep and steps data for one of the authors.

Other authors
See publication
Estimating the Burden of Influenza-like Illness on Daily Activity at the Population Scale Using Commercial Wearable Sensors

JAMA Network Open May 12, 2022
Question How can the true burden of influenza-like illnesses (ILIs) be estimated given that most cases of ILIs are mild and go undocumented ?

Findings This cohort study of 15 122 adults who reported ILI symptoms and had data from wearable sensors at symptom onset found an overall reduction in mobility equivalent to 15% of the active US population becoming completely immobilized for 1 day. More than 60% of this reduction occurred among persons who had sought no medical…

Question How can the true burden of influenza-like illnesses (ILIs) be estimated given that most cases of ILIs are mild and go undocumented ?

Findings This cohort study of 15 122 adults who reported ILI symptoms and had data from wearable sensors at symptom onset found an overall reduction in mobility equivalent to 15% of the active US population becoming completely immobilized for 1 day. More than 60% of this reduction occurred among persons who had sought no medical care.

Meaning This study suggests that the burden of ILIs is much greater than had previously been understood.

Other authors
See publication
Creating Evidence from Real World Patient Digital Data

Frontiers SA April 7, 2021
N-of-1 randomized controlled trials (RCTs) provide an opportunity to evaluate individual patient response to interventions, by randomly allocating different time periods within an individual to repeated intervention and control conditions and comparing responses. N-of-1 observational studies involve the repeated measurement of an outcome (e.g. pain) in a patient over time, but with no intervention implemented, in order to draw conclusions about naturally-occurring patterns and predictors of…

N-of-1 randomized controlled trials (RCTs) provide an opportunity to evaluate individual patient response to interventions, by randomly allocating different time periods within an individual to repeated intervention and control conditions and comparing responses. N-of-1 observational studies involve the repeated measurement of an outcome (e.g. pain) in a patient over time, but with no intervention implemented, in order to draw conclusions about naturally-occurring patterns and predictors of outcomes over time. Both N-of-1 RCTs and observational studies can have a ‘self-study’ design, where an individual conducts the study on themselves, to answer research questions they have generated themselves. N-of-1 RCTs and observational studies provide individualized findings that can be aggregated to produce results equivalent to those found in traditional group-based RCTs and population-level epidemiological studies, respectively, but requiring fewer patients for the same power.

N-of-1 RCTs and observational studies are well-suited to complement, strengthen, and generate advances in precision medicine, patient-centred healthcare, and personalised health. Since 2015, the number of N-of-1 articles has doubled annually.
Similarly, digital health is an exploding field, with over 1,000 studies registered on clinicaltrials.gov.
Digital health, and digital therapeutics in particular, complement N-of-1 RCTs and observational studies by providing relevant individualized health data from, for example, worn sensors, implants, regular lab assays, or -omics sequencing. Such data can be compared to population-health databases to target a patient’s strongest possible treatment option (as in cancer-risk studies) and, in turn, inform the design of an N-of-1 RCT to evaluate it.
Digital health data can also be continuously monitored during the study itself and used to help tailor a treatment to the needs and preferences of patients in real time.

Other authors
See publication
Inequity in California's Smokefree Workplace Laws: A Legal Epidemiologic Analysis of Loophole Closures

American Journal of Preventive Medicine January 15, 2020
Introduction
California's landmark 1994 Smokefree Workplace Act contained numerous exemptions, or loopholes, believed to contribute to inequities in smokefree air protections among low-income communities and communities of color (e.g., permitting smoking in warehouses, hotel common areas). Cities/counties were not prevented from adopting stronger laws. This study coded municipal laws and state law changes (in 2015–2016) for loophole closures and determined their effects in reducing…

Introduction
California's landmark 1994 Smokefree Workplace Act contained numerous exemptions, or loopholes, believed to contribute to inequities in smokefree air protections among low-income communities and communities of color (e.g., permitting smoking in warehouses, hotel common areas). Cities/counties were not prevented from adopting stronger laws. This study coded municipal laws and state law changes (in 2015–2016) for loophole closures and determined their effects in reducing inequities in smokefree workplace protections.

Methods
Public health attorneys reviewed current laws for 536 of California's 539 cities and counties from January 2017 to May 2018 and coded for 19 loophole closures identified from legislative actions (inter-rater reliability, 87%). The local policy data were linked with population demographics from intercensal estimates (2012–2016) and adult smoking prevalence (2014). The analyses were cross-sectional and conducted in February–June 2019.

Results
Between 1994 and 2018, jurisdictions closed 6.09 loopholes on average (SD=5.28). Urban jurisdictions closed more loopholes than rural jurisdictions (mean=6.40 vs 3.94, p<0.001), and loophole closure scores correlated positively with population size, median household income, and percentage white, non-Hispanic residents (p<0.001 for all). Population demographics and the loophole closure score explained 43% of the variance in jurisdictions’ adult smoking prevalence. State law changes in 2015–2016 increased loophole closure scores and decreased jurisdiction variation (mean=9.74, SD=3.56); closed more loopholes in rural versus urban jurisdictions (meangain=4.44 vs 3.72, p=0.002); and in less populated, less affluent jurisdictions, with greater racial/ethnic diversity, and higher smoking prevalence (p<0.001 for all).

Conclusions
Although jurisdictions made important progress in closing loopholes in smokefree air law, state law changes achieved greater reductions in inequities in policy coverage.

Other authors
See publication
Effects of Sleep Deprivation on Blood Glucose, Food Cravings, and Affect in a Non-Diabetic: An N-of-1 Randomized Pilot Study

Healthcare, Multidisciplinary Digital Publishing Institute December 25, 2019
Abstract: Sleep deprivation is a prevalent and rising health concern, one with known effects on blood glucose (BG) levels, mood, and calorie consumption. However, the mechanisms by which sleep deprivation affects calorie consumption (e.g., measured via self-reported types craved food) are unclear, and may be highly idiographic (i.e., individual specific). Single-case or “n-of-1” randomized trials (N1RT) are useful in exploring such effects by exposing each subject to both sleep deprivation and…

Abstract: Sleep deprivation is a prevalent and rising health concern, one with known effects on blood glucose (BG) levels, mood, and calorie consumption. However, the mechanisms by which sleep deprivation affects calorie consumption (e.g., measured via self-reported types craved food) are unclear, and may be highly idiographic (i.e., individual specific). Single-case or “n-of-1” randomized trials (N1RT) are useful in exploring such effects by exposing each subject to both sleep deprivation and baseline conditions, thereby characterizing effects specific to that individual. We had two objectives: (1) To test and generate individual-specific N1RT hypotheses of the effects of sleep deprivation on next-day BG level, mood, and food cravings in two non-diabetic individuals; (2) To refine and guide a future n-of-1 study design for testing and generating such idiographic hypotheses for personalized management of sleep behavior in particular, and for chronic health conditions more broadly. We initially did not find evidence for an idiographic effect of sleep deprivation, but better-refined post hoc findings indicate that sleep deprivation may have increased BG fluctuations, cravings, and negative emotions. We also introduce an application of mixed-effects models and pancit plots to assess idiographic effects over time.

Other authors
See publication
Person as Population: A Longitudinal View of Single-Subject Causal Inference for Analyzing Self-Tracked Health Data

arXiv January 22, 2019

Single-subject health data are becoming increasingly available thanks to advances in self-tracking technology (e.g., mobile devices, apps, sensors, implants). Many users and health caregivers seek to use such observational time series data to recommend changing health practices in order to achieve desired health outcomes. However, there are few available causal inference approaches that are flexible enough to analyze such idiographic data. We develop a recently introduced framework, and…

Single-subject health data are becoming increasingly available thanks to advances in self-tracking technology (e.g., mobile devices, apps, sensors, implants). Many users and health caregivers seek to use such observational time series data to recommend changing health practices in order to achieve desired health outcomes. However, there are few available causal inference approaches that are flexible enough to analyze such idiographic data. We develop a recently introduced framework, and implement a flexible random-forests g-formula approach to estimating a recurring individualized effect called the "average period treatment effect". In the process, we argue that our approach essentially resembles that of a longitudinal study by partitioning a single time series into periods taking on binary treatment levels. We analyze six years of the author's own self-tracked physical activity and weight data to demonstrate our approach, and compare the results of our analysis to one that does not properly account for confounding.

See publication
Causal analysis of self-tracked time series data using a counterfactual framework for n-of-1 trials.

Methods of Information in Medicine Feb 2018

Many of an individual's historically recorded personal measurements vary over time, thereby forming a time series (e.g., wearable-device data, self-tracked fitness or nutrition measurements, clinical/chronic conditions). Statistical analyses of such n-of-1 (i.e., single-subject) observational studies (N1OSs) can be used to discover possible cause-effect relationships to then self-test in an n-of-1 randomized trial (N1RT). However, a principled way of determining how and when to interpret an…

Many of an individual's historically recorded personal measurements vary over time, thereby forming a time series (e.g., wearable-device data, self-tracked fitness or nutrition measurements, clinical/chronic conditions). Statistical analyses of such n-of-1 (i.e., single-subject) observational studies (N1OSs) can be used to discover possible cause-effect relationships to then self-test in an n-of-1 randomized trial (N1RT). However, a principled way of determining how and when to interpret an N1OS association as a causal effect (e.g., as if randomization had occurred) is needed. Our goal in this paper is to help bridge the methodological gap between risk-factor discovery and N1RT testing by introducing a basic counterfactual framework for N1OS design and personalized causal analysis. We introduce and characterize what we call the average period treatment effect (APTE), i.e., the estimand of interest in an N1RT, and build an analytical framework around it that can accommodate autocorrelation and time trends in the outcome, effect carryover from previous treatment periods, and slow onset or decay of the effect. The APTE is loosely defined as a contrast (e.g., difference, ratio) of average outcomes the individual can experience under different treatment levels at a given treatment period. To illustrate the utility of our framework for APTE discovery and estimation, two common causal inference methods are specified within the N1OS context. We then apply the framework and methods to search for estimable and interpretable APTEs using six years of the author's self-tracked weight and exercise data, and report both the preliminary findings and the challenges we faced in conducting N1OS causal discovery. Causal analysis of an individual's time series data can be facilitated by an N1RT counterfactual framework. For inference to be valid, the veracity of certain key assumptions must be assessed critically, and the hypothesized causal models must be interpretable and meaningful.

See publication
Thyroid cancer mortality higher in Filipinos in United States: an analysis using national mortality records from 2003-2012.

Cancer Sep 2017
Background: Well-differentiated thyroid carcinoma has a favorable prognosis, but patients with multiple recurrences have drastically lower survival. Filipinos in the US are known to have high thyroid cancer incidence and recurrence rates. It is unknown whether Filipinos also have higher thyroid cancer mortality rates.
Methods: We studied thyroid cancer mortality in Filipino, non-Filipino Asian (NFA), and non-Hispanic White (NHW) adults using US death records (2003-2012) and US Census data…

Background: Well-differentiated thyroid carcinoma has a favorable prognosis, but patients with multiple recurrences have drastically lower survival. Filipinos in the US are known to have high thyroid cancer incidence and recurrence rates. It is unknown whether Filipinos also have higher thyroid cancer mortality rates.
Methods: We studied thyroid cancer mortality in Filipino, non-Filipino Asian (NFA), and non-Hispanic White (NHW) adults using US death records (2003-2012) and US Census data. Age-adjusted mortality rates (AMRs) and proportional mortality ratios (PMRs) were calculated. Gender, nativity status, age at death, and educational attainment were also examined.
Results: We examined 19,940,952 deaths. AMR due to thyroid cancer was highest in Filipinos (1.72 deaths per 100,000, 95% CI 1.51-1.95) compared to NFAs (1.03 per 100,000, 95% CI 0.95-1.12) and NHWs (1.17 per 100,000, 95% CI 1.16-1.18). Compared to NHWs, higher proportionate mortality was observed in Filipino women (3-5 times higher) across all age groups, and Filipino men had 2-3 times higher PMR in the subgroup over the age of 55. Filipinos that completed higher education had notably higher PMR (5.0) than their counterparts who had not (3.5).
Conclusions: Negative prognostic factors for thyroid cancer traditionally include “age greater than 45 years” and “male gender.” We demonstrate that Filipinos die of thyroid cancer at higher rates than NFAs and NHWs of similar ages. Highly-educated Filipinos and Filipino women may be especially at risk for poor thyroid cancer outcomes. Filipino ethnicity should be factored into clinical decision-making in the management of thyroid cancer.

Other authors
Estimating inverse-probability weights for longitudinal data with dropout or truncation: The xtrccipw command

The Stata Journal June 28, 2017
Abstract. Individuals may drop out of a longitudinal study, rendering their outcomes unobserved but still well defined. However, they may also undergo truncation (for example, death), beyond which their outcomes are no longer meaningful. Kurland and Heagerty (2005, Biostatistics 6: 241–258) developed a method to conduct regression conditioning on nontruncation, that is, regression conditioning on continuation (RCC), for longitudinal outcomes that are monotonically missing at random (for…

Abstract. Individuals may drop out of a longitudinal study, rendering their outcomes unobserved but still well defined. However, they may also undergo truncation (for example, death), beyond which their outcomes are no longer meaningful. Kurland and Heagerty (2005, Biostatistics 6: 241–258) developed a method to conduct regression conditioning on nontruncation, that is, regression conditioning on continuation (RCC), for longitudinal outcomes that are monotonically missing at random (for example, because of dropout). This method first estimates the probability of dropout among continuing individuals to construct inverse-probability weights (IPWs), then fits generalized estimating equations (GEE) with these IPWs. In this article, we present the xtrccipw command, which can both estimate the IPWs required by RCC and then use these IPWs in a GEE estimator by calling the glm command from within xtrccipw. In the absence of truncation, the xtrccipw command can also be used to run a weighted GEE analysis. We demonstrate the xtrccipw command by analyzing an example dataset and the original Kurland and Heagerty (2005) data. We also use xtrccipw to illustrate some empirical properties of RCC through a simulation study.

Other authors
See publication
"Uncertain times call for certain measurements."

Science: NextGen voices: Advocacy in brief April 7, 2017
Other authors
See publication
Multiple Cause-of-Death Analysis Reveals Under Reporting of Disease Burden for Diabetes

American Heart Association, Inc. March 7, 2017
Introduction: Despite being considerably under reported as the underlying cause of death on death certificates, and consequently on mortality figures, diabetes is among the ten leading causes of death in the U.S. A multiple cause-of-death analysis shows the extent to which diabetes is associated with other leading causes of death.

Hypothesis: Analysis of multiple-cause-of-death will confirm prevalence rates of diabetes among racial/ethnic minority populations, demonstrate the impact of…

Introduction: Despite being considerably under reported as the underlying cause of death on death certificates, and consequently on mortality figures, diabetes is among the ten leading causes of death in the U.S. A multiple cause-of-death analysis shows the extent to which diabetes is associated with other leading causes of death.

Hypothesis: Analysis of multiple-cause-of-death will confirm prevalence rates of diabetes among racial/ethnic minority populations, demonstrate the impact of diabetes in association with other causes of death, and highlight variations of burden of disease among different racial/ethnic groups.

Methods: Causes of death were identified using the Multiple Cause Mortality Files of the National Center for Health Statistics from 2003 to 2012. Age-adjusted mortality rates were calculated for diabetes both as the underlying cause of death (UCD) and as multiple causes of death (MCD) by racial/ethnic groups (NHWs, Blacks, Asians, and Hispanic/Latinos). Frequencies and proportions were calculated by race/ethnicity groups. Linear regression model was used for number of causes per death.

Results: A total of 2,335,198 decedents had diabetes listed as MCD in the U.S. national death records from 2003-2012. Mortality rates of diabetes as MCD were 3.4 times than UCD for Asians, 2.9 times for Blacks, 2.9 times for Hispanics and 3.7 times for NHWs (Figure). Minority populations had higher proportion of deaths with diabetes reported as MCD than NHWs (1.7 times higher for Hispanics, 1.5 times higher for Blacks and Asians). Adjusting for age, gender, and race/ethnicity, there were 1.7 more causes per death co-occurred for diabetes decedents compared to decedents who died due to all other causes (95% CI: 1.714, 1.718).

Conclusions: Our findings underscore the importance of a multiple-cause-of-death approach in the analyses for a more comprehensive understanding of the impact of diabetes.

Other authors
See publication
A Bayesian approach to the g-formula

Statistical Methods in Medical Research January 1, 2017
Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health interventions. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin’s original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentist properties of the Bayesian g-formula suggest it improves the accuracy of estimates of causal…

Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health interventions. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin’s original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentist properties of the Bayesian g-formula suggest it improves the accuracy of estimates of causal effects in small samples or when data are sparse. We demonstrate an approach to estimate the effect of environmental tobacco smoke on body mass index among children aged 4–9 years who were enrolled in a longitudinal birth cohort in New York, USA. We provide an algorithm and supply SAS and Stan code that can be adopted to implement this computational approach more generally.

Other authors
See publication
Likelihood of unemployed smokers vs nonsmokers attaining reemployment in a one-year observational study

JAMA Internal Medicine April 11, 2016
Importance: Studies in the United States and Europe have found higher smoking prevalence among unemployed job seekers relative to employed workers. While consistent, the extant epidemiologic investigations of smoking and work status have been cross-sectional, leaving it underdetermined whether tobacco use is a cause or effect of unemployment.

Conclusions and Relevance: To our knowledge, this is the first study to prospectively track reemployment success by smoking status. Smokers had a…

Importance: Studies in the United States and Europe have found higher smoking prevalence among unemployed job seekers relative to employed workers. While consistent, the extant epidemiologic investigations of smoking and work status have been cross-sectional, leaving it underdetermined whether tobacco use is a cause or effect of unemployment.

Conclusions and Relevance: To our knowledge, this is the first study to prospectively track reemployment success by smoking status. Smokers had a lower likelihood of reemployment at 1 year and were paid significantly less than nonsmokers when reemployed. Treatment of tobacco use in unemployment service settings is worth testing for increasing reemployment success and financial well-being.

Other authors
See publication
Longitudinal regression conditioning on continuation

The University of North Carolina at Chapel Hill, ProQuest Dissertations Publishing December 1, 2015

Individuals in a longitudinal study may have missing data for multiple reasons, including intermittent
missed visits or permanent study drop out. Additionally, individuals may experience
a truncating event, such as death, past which the outcomes of interest are no longer meaningful.
Kurland and Heagerty (2005) developed a method to conduct regression conditioning on being
alive (RCA), which constructs inverse-probability weights (IPWs) of the dropout probability
among continuing…

Individuals in a longitudinal study may have missing data for multiple reasons, including intermittent
missed visits or permanent study drop out. Additionally, individuals may experience
a truncating event, such as death, past which the outcomes of interest are no longer meaningful.
Kurland and Heagerty (2005) developed a method to conduct regression conditioning on being
alive (RCA), which constructs inverse-probability weights (IPWs) of the dropout probability
among continuing individuals used to t generalized estimating equations (GEE). RCA has
since been extended to allow for intermittent missingness (IM) of outcomes (Shardell and Miller,
2008). We further extend these methods to simultaneously accommodate dierent mechanisms
for dropout and IM, and call our method regression conditioning on continuation (RCC). RCC is
illustrated using data from a recent study of mother-to-child transmission of HIV to draw inference
on mean infant weights subject to truncation from infant death or HIV infection.

Currently, there is no widely available software for conducting RCA. We present the xtrccipw
command in Stata, which can estimate the dropout IPWs required by RCC if there is no IM.
These IPWs estimated using xtrccipw are then used as weights in a GEE estimator using the
glm command, completing the RCC method. In the absence of truncation, the xtrccipw and
glm commands can also be used in a weighted GEE analysis. The xtrccipw command is demonstrated
by analyzing two example datasets and the original Kurland and Heagerty (2005) data.

See publication
Integrating group counseling, cell phone messaging, and participant-generated songs and dramas into a microcredit program increases Nigerian women's adherence to international breastfeeding recommendations.

The Journal of Nutrition May 8, 2014
In northern Nigeria, interventions are urgently needed to narrow the large gap between international breastfeeding recommendations and actual breastfeeding practices. Studies of integrated microcredit and community health interventions documented success in modifying health behaviors but typically had uncontrolled designs. We conducted a cluster-randomized controlled trial in Bauchi State, Nigeria, with the aim of increasing early breastfeeding initiation and exclusive breastfeeding among…

In northern Nigeria, interventions are urgently needed to narrow the large gap between international breastfeeding recommendations and actual breastfeeding practices. Studies of integrated microcredit and community health interventions documented success in modifying health behaviors but typically had uncontrolled designs. We conducted a cluster-randomized controlled trial in Bauchi State, Nigeria, with the aim of increasing early breastfeeding initiation and exclusive breastfeeding among female microcredit clients. ... In conclusion, a breastfeeding promotion intervention integrated into microcredit increased the likelihood that women adopted recommended breastfeeding practices. This intervention could be scaled up in Nigeria, where local organizations provide microcredit to >500,000 clients. Furthermore, the intervention could be adopted more widely given that >150 million women, many of childbearing age, are involved in microfinance globally. This trial was registered at clinicaltrials.gov as NCT01352351.

Other authors
See publication
Plasma and breast-milk selenium in HIV-infected Malawian mothers are positively associated with infant selenium status but are not associated with maternal supplementation: results of the Breastfeeding, Antiretrovirals, and Nutrition study.

The American Journal of Clinical Nutrition April 1, 2014
Background: Selenium is found in soils and is essential for human antioxidant defense and immune function. In Malawi, low soil selenium and dietary intakes coupled with low plasma selenium concentrations in HIV infection could have negative consequences for the health of HIV-infected mothers and their exclusively breastfed infants.

Objective: We tested the effects of lipid-based nutrient supplements (LNS) that contained 1.3 times the Recommended Dietary Allowance of sodium selenite and…

Background: Selenium is found in soils and is essential for human antioxidant defense and immune function. In Malawi, low soil selenium and dietary intakes coupled with low plasma selenium concentrations in HIV infection could have negative consequences for the health of HIV-infected mothers and their exclusively breastfed infants.

Objective: We tested the effects of lipid-based nutrient supplements (LNS) that contained 1.3 times the Recommended Dietary Allowance of sodium selenite and antiretroviral drugs (ARV) on maternal plasma and breast-milk selenium concentrations.

...

Conclusions: Selenite supplementation of HIV-infected Malawian women was not associated with a change in their plasma or breast-milk selenium concentrations. Future research should examine effects of more readily incorporated forms of selenium (ie, selenomethionine) in HIV-infected breastfeeding women. This trial was registered at clinicaltrials.gov as NCT00164736.

Other authors
See publication
Changes in soluble transferrin receptor and hemoglobin concentrations in Malawian mothers are associated with those values in their exclusively breastfed, HIV-exposed infants.

The Journal of Nutrition March 1, 2014
Infant iron status at birth is influenced by maternal iron status during pregnancy; however, there are limited data on the extent to which maternal iron status is associated with infant iron status during exclusive breastfeeding. We evaluated how maternal and infant hemoglobin and iron status [soluble transferrin receptors (TfR) and ferritin] were related during exclusive breastfeeding in HIV-infected women and their infants. The Breastfeeding, Antiretrovirals, and Nutrition Study was a…

Infant iron status at birth is influenced by maternal iron status during pregnancy; however, there are limited data on the extent to which maternal iron status is associated with infant iron status during exclusive breastfeeding. We evaluated how maternal and infant hemoglobin and iron status [soluble transferrin receptors (TfR) and ferritin] were related during exclusive breastfeeding in HIV-infected women and their infants. The Breastfeeding, Antiretrovirals, and Nutrition Study was a randomized controlled trial in Lilongwe, Malawi, in which HIV-infected women were assigned with a 2 × 3 factorial design to a lipid-based nutrient supplement (LNS), or no LNS, and maternal, infant, or no antiretroviral drug, and followed for 24 wk. Longitudinal models were used to relate postpartum maternal hemoglobin (n = 1926) to concurrently measured infant hemoglobin, adjusting for initial infant hemoglobin values. In a subsample, change in infant iron status (hemoglobin, log ferritin, log TfR) between 2 (n = 352) or 6 wk (n = 167) and 24 wk (n = 519) was regressed on corresponding change in the maternal indicator, adjusting for 2 or 6 wk values. A 1 g/L higher maternal hemoglobin at 12, 18, and 24 wk was associated with a 0.06 g/L (P = 0.01), 0.10 g/L (P < 0.001), and 0.06 g/L (P = 0.01), respectively, higher infant hemoglobin. In the subsample, a reduction in maternal log TfR and an increase in hemoglobin from initial measurement to 24 wk were associated with the same pattern in infant values (log TfR β = −0.18 mg/L, P < 0.001; hemoglobin β = 0.13 g/L, P = 0.01). Given the observed influence of maternal and initial infant values, optimizing maternal iron status in pregnancy and postpartum is important to protect infant iron status. This trial was registered at clinicaltrials.gov as NCT00164736.

Other authors
See publication

Courses

Advanced Econometrics

ECON 870
Advanced Linear Models I

BIOS 762
Advanced Statistical Methods in Biometric and Public Health

BIOS 758
Advanced Survey Sampling Methods

BIOS 764
Analysis of Categorical Data

BIOS 665
Bayesian Statistics

BIOS 779
Causal Inference in Biomedical Research (audited)

BIOS 776
Causal Inference in Public Health

MCH 759
Data Management in Clinical and Public Health Research

BIOS 613
Demographic Techniques I

BIOS 670
Design and Analysis of Clinical Trials

BIOS 752
Group Lessons in Voice

MUSC 111
Health Effects of Environmental Agents

ENVR 430
International and Comparative Health Systems

HPM 660
Introductory Survivorship Analysis

BIOS 680
Longitudinal Data Analysis

BIOS 767
Practice in Statistical Consulting

BIOS 842
Principles of Epidemiology

EPID 600
Principles of Statistical Consulting

BIOS 841
Research Ethics

GRAD 721
Social and Behavioral Sciences in Public Health

HBHE 600
Structural Equations with Latent Variables

SOCI 717
Systematic Review and Meta-Analysis

EPID 730
Systems Biology in Environmental Health

ENVR 890
Training in Statistical Teaching in the Health Sciences

BIOS 850
Biostatistical Methods: Survival Analysis and Causality

PB HLTH C240B
Censored Longitudinal Data and Causality

PB HLTH C246A
Introduction to Analysis

MATH 104
Linear Algebra & Differential Equations

MATH 54
Multivariable Calculus

MATH 53
Computing for Data Science (audited) (http://statweb.stanford.edu/~naras/stat290/Stat290_Website/Stat_290.html)

STATS 290

Projects

N-of-5 Minutes

Nov 2022
Five minute chats with experts from all corners of science about statistics that can be applied to a single person. All scientists and non-scientists are welcomed! This podcast is hosted by Stats-of-1 (statsof1.org), a blog on a mission to improve personalized health through digital individualized statistical designs or methods.

Other creators
Stats-of-1

Feb 2020
Stats-of-1 is a newsletter that seeks to improve personalized health by promoting the expanded use of individual-focused, quantitative idiography (QI) statistical designs and methods in health and medicine. Collectively, these QI approaches define the field of esametry (derived from “isa”, the Tagalog Filipino word for “one”).

Inspired by the intra-individual or within-individual approaches of n-of-1 trials, single-case experimental designs, and single-subject research, we are building a…

Stats-of-1 is a newsletter that seeks to improve personalized health by promoting the expanded use of individual-focused, quantitative idiography (QI) statistical designs and methods in health and medicine. Collectively, these QI approaches define the field of esametry (derived from “isa”, the Tagalog Filipino word for “one”).

Inspired by the intra-individual or within-individual approaches of n-of-1 trials, single-case experimental designs, and single-subject research, we are building a community of statistics-savvy pioneers who use modern tools of digital health (e.g., wearables, sensors) to create or adapt statistical techniques to study a single individual’s recurring trends. Visit our About page to learn more.

Other creators
See project
Causes and Associations in Single-Individual Analysis (CASIA) [pronounced: ka-sha]

Sep 2016 - 2020
Goal: To establish the feasibility of applying causal inference methods to a single individual's data to improve causal discovery for personalized/precision health. To develop observational and experimental study design and analysis methods where needed.

Other creators
See project
DISCOVeR, Stanford University School of Medicine, Palo Alto Medical Foundation

Oct 2015 - Oct 2018
http://med.stanford.edu/discover.html
http://www.pamf.org/discover/causes.html

Other creators
See project
The Breastfeeding, Antiretrovirals, and Nutrition Study: Malawi Mothers and Infants (BAN: MaMi)

May 2010 - Sep 2015
http://www.cpc.unc.edu/news/features/improving-health-and-survival-in-malawi
http://hpdp.unc.edu/research/projects/ban/

Other creators
See project
A Statistical New World

Jun 2011 - Jun 2011
My talented colleagues and friends produced the vision and lyrics for this educational and entertaining music video based on Disney's "A Whole New World" on how statistics educates and empowers. I was fortunate to be able to work with them, and to lend my theatrical, vocal, and music-production expertise to their project.

Other creators
See project
Non-response in Wave IV of the Add Health Study

Jun 2010 - Jul 2010
Non-response is a potential threat to the accuracy of estimates obtained from sample surveys and can be particularly difficult to avoid in longitudinal studies. The objective of this report is to investigate non-response and consequent bias in estimates for Wave IV of the National Longitudinal Study of Adolescent Health (Add Health). The Survey Research Unit at the University of North Carolina at Chapel Hill previously analyzed the non-response rates for the first three waves of Add Health. As…

Non-response is a potential threat to the accuracy of estimates obtained from sample surveys and can be particularly difficult to avoid in longitudinal studies. The objective of this report is to investigate non-response and consequent bias in estimates for Wave IV of the National Longitudinal Study of Adolescent Health (Add Health). The Survey Research Unit at the University of North Carolina at Chapel Hill previously analyzed the non-response rates for the first three waves of Add Health. As shown in Chantala, Kalsbeek and Andraca, 2005, the total bias in Waves I, II, and III for 13 measures of health and risk behaviors rarely exceed 1%, which is small relative to the 20% to 80% prevalence rates for most of these measures. Results are similar for Wave IV.

In this paper, first, we outline the Wave IV sampling design and results of the field work. Second, we characterize the non-response rates overall and stratified by a number of demographic variables. Next, we use data on the health risk measures reported by Wave IV responders and non-responders during their Wave I In-home interview to estimate total and relative bias due to non-response in Wave IV. We conclude with a discussion of Wave IV bias due to non-response.

Other creators
See project

Honors & Awards

Young Investigator Award, 2018 Sage Assembly: Algorithms and the Role of the Individual

Sage Bionetworks

Feb 2018
Improving personalized medicine through n-of-1 causal inference and predictive modeling

Stanford Center for Clinical and Translational Research and Education (Spectrum), Population Health Sciences

May 2017

"Improving personalized medicine through n-of-1 causal inference and predictive modeling." 1 May 2017 - 30 Jun 2018. $26,000.

http://med.stanford.edu/phs/phs-grants.html

Languages

Tagalog

Native or bilingual proficiency
Spanish

Limited working proficiency
English

Native or bilingual proficiency
SAS

Full professional proficiency
Stata

Full professional proficiency
R

Native or bilingual proficiency
SQL

Professional working proficiency
Filipino

Native or bilingual proficiency
Python

Limited working proficiency

Organizations

American Statistical Association: Justice, Equity, Diversity and Inclusion Outreach Group

Member, PDC Chair, JEDI Chair-Elect

Jan 2021 - Present

http://www.datascijedi.org/
American Statistical Association, San Francisco Bay Area Chapter

member

Oct 2015 - Present
American Statistical Association

member

Sep 2007 - Present
American Statistical Association

-

Aug 2007 - Present
American Statistical Association

-

Jan 2007 - Present
American Statistical Association, North Carolina Chapter

member

2007 - Present
Risley Residential College for the Performing Arts

tech crew, actor, tech director

Sep 1996 - May 2002

Recommendations received

LinkedIn User

“I had the pleasure of working with Eric at Evidation, and I can't say enough positive things about his work and his energy. When it comes to his craft, Eric is a clear thought leader who has deep subject matter expertise. Not only does this allow him to lead projects and strategy, but it centers him as a key collaborator and consultant to so many teams. For example, I personally had the opportunity to tag team with Eric on a study where I required a deeper understanding of the statistical analysis plan to inform the recruitment planning I was conducting. He was so easy to approach with questions, acted as a true thought partner, and ultimately supported me in crafting a plan that was aligned with overall study goals. However, his wealth of knowledge is somehow not what is so impressive about him. What stands out more is that in addition to all the technical knowledge and accolades that he's accumulated throughout his career, he's one of the most humble, thoughtful, and empathic people you'll ever meet. He cares about the people he works with and it shows in his every interaction. He's also a DEI advocate who embeds his values into his work and relationships, making project deliverables and company culture stronger. Overall, I can't say enough about how much I loved working with Eric! He's an asset to any team and company.”
Volodymyr Serhiyenko, PhD

“Eric Jay possesses a profound statistical and scientific knowledge. I always rely on his opinion about different methodologies and value his input into modeling discussions. Eric Jay is a great presenter of statistical results. He can clearly highlight the important pieces and create examples that people without statistical background can easily relate to in everyday life. Eric Jay is also a self-starter and work well as an independent researcher. He rarely needs additional input and can create a lot of rigorous output in form of study protocols, proof of concepts, paper drafts, and Confluence pages. Finally, Eric Jay is a great teammate. He communicates very well with other team members. Eric Jay is always positive during the discussions and gives a great feedback that vastly improves the outcomes of the meeting. ”
Iman Haji

“Eric J is a "true scientist" who focuses on high impact real-world applications. One of the highlights of my work at Clarify Health was our brainstorming sessions with Eric J, where he always found subtle scientific and statistical explanations for seemingly unsolvable problems. He is the brain who thrives by solving complex puzzles and I am glad that through my interactions with him I got the chance to broaden my horizons in statistical analysis and data science. On the personal level, he is a calm friendly team-player who always smiles :)”

3 people have recommended Eric J. Join now to view

More activity by Eric J.

A bit of personal news: I will be stepping down as Dean of the Boston University School of Public Health at the end of the year…

Liked by Eric J. Daza, DrPH, MPS

NEVER STAND STILL || Always happy to be back here! Thanks for the gift of dual masters in public health economics and management… And now, for this…

Liked by Eric J. Daza, DrPH, MPS

This post isn't about me and my work - it's about my wife. After an eighteen year hiatus, she's trying to re-enter the workforce in public health…

Liked by Eric J. Daza, DrPH, MPS

🌟 Exciting Opportunity Alert! 🌟 Bescy is growing quickly, and we're seeking a visionary leader like you to join us as our Vice President of…

Liked by Eric J. Daza, DrPH, MPS

QUESTION FOR SOLO ENTREPRENEURS When it comes to running a business, I tend to get stuck in my ways and don't try new tools nearly enough. Most if…

Liked by Eric J. Daza, DrPH, MPS

Boehringer Ingelheim partners with Walgreens in clinical trials for obesity treatment: 🧪The pair have partnered to enhance clinical trial…

Liked by Eric J. Daza, DrPH, MPS

#ICER is debuting a clinical diversity framework…and found no consistent measures of diversity (no surprise)…#askpatients #patientengagement…

Liked by Eric J. Daza, DrPH, MPS

It's official now. As some of you may know, I have been selected as a Fellow of the American Statistical Association. In my case, the following…

Liked by Eric J. Daza, DrPH, MPS

View Eric J.’s full profile

See who you know in common
Get introduced
Contact Eric J. directly

Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses

See all courses

Eric J. Daza, DrPH, MPS

Menlo Park, California, United States 7K followers 500+ connections

See your mutual connections View mutual connections with Eric J. Sign in Welcome back Email or phone Password Show Forgot password? Sign in or New to LinkedIn? Join now or New to LinkedIn? Join now

About

Contributions

Activity

Recent Stats+Stories #Podcast - American Statistical Association - ASA Noel Cressie talks about the WOllongong Methodology for Bayesian Assimilation…

Liked by Eric J. Daza, DrPH, MPS

Evidation Shares Real-World Insights on Weight Management & Healthcare Access at ISPOR 2024 This week, our research team is sharing findings from…

Liked by Eric J. Daza, DrPH, MPS

One of the most overlooked behaviors that promotes optimal health is adequate sleep. See NIH study here: https://lnkd.in/e9-p5kYe CDC has just…

Liked by Eric J. Daza, DrPH, MPS

Experience

Keep AI

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Education

-

-

-

-

-

-

-

Volunteer Experience

Chairperson

American Statistical Association - ASA

Volunteer

Braver Angels

Licenses & Certifications

Publications

United States Patent and Trademark Office February 8, 2024

JMIR Form Res November 30, 2023

Digital Health August 24, 2022

arXiv August 1, 2022

JAMA Network Open May 12, 2022

Frontiers SA April 7, 2021

American Journal of Preventive Medicine January 15, 2020

Healthcare, Multidisciplinary Digital Publishing Institute December 25, 2019

arXiv January 22, 2019

Methods of Information in Medicine Feb 2018

Thyroid cancer mortality higher in Filipinos in United States: an analysis using national mortality records from 2003-2012.

Cancer Sep 2017

The Stata Journal June 28, 2017

Science: NextGen voices: Advocacy in brief April 7, 2017

American Heart Association, Inc. March 7, 2017

Statistical Methods in Medical Research January 1, 2017

JAMA Internal Medicine April 11, 2016

The University of North Carolina at Chapel Hill, ProQuest Dissertations Publishing December 1, 2015

The Journal of Nutrition May 8, 2014

The American Journal of Clinical Nutrition April 1, 2014

The Journal of Nutrition March 1, 2014

Courses

Advanced Econometrics

ECON 870

Advanced Linear Models I

BIOS 762

Advanced Statistical Methods in Biometric and Public Health

BIOS 758

Advanced Survey Sampling Methods

BIOS 764

Analysis of Categorical Data

BIOS 665

Bayesian Statistics

BIOS 779

Causal Inference in Biomedical Research (audited)

Menlo Park, California, United States
7K followers 500+ connections

View mutual connections with Eric J.

Welcome back

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

or

New to LinkedIn? Join now