“I had the pleasure of working with Eric at Evidation, and I can't say enough positive things about his work and his energy. When it comes to his craft, Eric is a clear thought leader who has deep subject matter expertise. Not only does this allow him to lead projects and strategy, but it centers him as a key collaborator and consultant to so many teams. For example, I personally had the opportunity to tag team with Eric on a study where I required a deeper understanding of the statistical analysis plan to inform the recruitment planning I was conducting. He was so easy to approach with questions, acted as a true thought partner, and ultimately supported me in crafting a plan that was aligned with overall study goals. However, his wealth of knowledge is somehow not what is so impressive about him. What stands out more is that in addition to all the technical knowledge and accolades that he's accumulated throughout his career, he's one of the most humble, thoughtful, and empathic people you'll ever meet. He cares about the people he works with and it shows in his every interaction. He's also a DEI advocate who embeds his values into his work and relationships, making project deliverables and company culture stronger. Overall, I can't say enough about how much I loved working with Eric! He's an asset to any team and company.”
Eric J. Daza, DrPH, MPS
Menlo Park, California, United States
7K followers
500+ connections
About
I help you use your own data to learn about yourself. How? Thorough statistical and…
Contributions
-
What are the statistical analysis tools that can be used to analyze survey data?
Intensive longitudinal data approaches can be appropriate for survey outcomes collected very frequently. These include time-varying coefficient models and multilevel time series models. See: https://statsof1.org/resources/#aggregated-n-of-1-analyses —in particular, Walls and Shafer (2006).
-
What are the most effective ways to transition to a career in statistics?
'While “being a data scientist” requires knowing core statistical principles and having basic statistical skills, “being a statistician” in data science can be a liability!' More from my 20+ years as a biostatistian-turned-data-scientist here: https://towardsdatascience.com/why-you-should-think-of-the-enterprise-of-data-science-more-like-a-business-less-like-science-45227c65c09
-
What are the most effective ways to transition to a career in statistics?
For folks with doctoral training in particular, it might help to know that I rely on and apply these two core technical concepts all the time: 1. (straight-up statistics) the Law of Total Expectation E(Y) = E{E(Y|X)} 2. (statistics and machine learning connections) the implicit prediction-expectation approximate equivalence E(Y|X) ≈ Y ~ f(X) (i.e., conditional expectation ≈ prediction function)
-
What distinguishes the job responsibilities of a statistician from those of a data analyst?
"Academic statisticians and clinical-trials biostatisticians rarely (if ever) work as co-investigators; we’re not trained to do so. My academic training tells me that making decisions as a co-investigator is overstepping, maybe even disrespectful (i.e., by sending the message that I don’t trust the scientist, even though it’s clearly not my field of expertise). It’s not a typical responsibility for professional statisticians." https://towardsdatascience.com/why-you-should-think-of-the-enterprise-of-data-science-more-like-a-business-less-like-science-45227c65c09
Activity
-
Recent Stats+Stories #Podcast - American Statistical Association - ASA Noel Cressie talks about the WOllongong Methodology for Bayesian Assimilation…
Recent Stats+Stories #Podcast - American Statistical Association - ASA Noel Cressie talks about the WOllongong Methodology for Bayesian Assimilation…
Liked by Eric J. Daza, DrPH, MPS
-
Evidation Shares Real-World Insights on Weight Management & Healthcare Access at ISPOR 2024 This week, our research team is sharing findings from…
Evidation Shares Real-World Insights on Weight Management & Healthcare Access at ISPOR 2024 This week, our research team is sharing findings from…
Liked by Eric J. Daza, DrPH, MPS
-
One of the most overlooked behaviors that promotes optimal health is adequate sleep. See NIH study here: https://lnkd.in/e9-p5kYe CDC has just…
One of the most overlooked behaviors that promotes optimal health is adequate sleep. See NIH study here: https://lnkd.in/e9-p5kYe CDC has just…
Liked by Eric J. Daza, DrPH, MPS
Experience
-
Keep AI
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Education
Volunteer Experience
-
Chairperson
Chairperson
American Statistical Association - ASA
-
Volunteer
Volunteer
Braver Angels
Licenses & Certifications
Publications
-
Methods and Systems for Generating Personalized Treatments via Causal Inference
United States Patent and Trademark Office
Provided herein are systems, methods, computer-readable media, and techniques for generating a personalized recommended intervention for a subject based on causal inference, including: obtaining a first set of time series data and a second set of time series data, the first set of time series data relating to a first variable indicative of a health behavior of the subject, and the second set of time series data relating to a second variable indicative of a health condition of the subject;…
Provided herein are systems, methods, computer-readable media, and techniques for generating a personalized recommended intervention for a subject based on causal inference, including: obtaining a first set of time series data and a second set of time series data, the first set of time series data relating to a first variable indicative of a health behavior of the subject, and the second set of time series data relating to a second variable indicative of a health condition of the subject; determining a causal effect of the first variable on the second variable by estimating an average treatment effect, wherein the average treatment effect is estimated by processing the first set of time series data and the second set of time series data using a model-twin randomization method; and generating a personalized treatment or intervention recommendation for the subject to change the health condition based on the causal effect.
-
Patient and Physician Perspectives on the Use of a Connected Ecosystem for Diabetes Management: International Cross-Sectional Observational Study
JMIR Form Res
Collaboration between people with type 2 diabetes (T2DM) and their health care teams is important for optimal control of the disease and outcomes. Digital technologies could potentially tie together several health care-related devices and platforms into connected ecosystems (CES), but attitudes about CES are unknown.
We surveyed convenience samples of patients and physicians to better understand which patient characteristics are associated with higher likelihoods of (1) participating in…Collaboration between people with type 2 diabetes (T2DM) and their health care teams is important for optimal control of the disease and outcomes. Digital technologies could potentially tie together several health care-related devices and platforms into connected ecosystems (CES), but attitudes about CES are unknown.
We surveyed convenience samples of patients and physicians to better understand which patient characteristics are associated with higher likelihoods of (1) participating in a potential CES program, as self-reported by patients with T2DM and (2) clinical benefit from participation in a potential CES program, as reported by physicians.Other authorsSee publication -
What possibly affects nighttime heart rate? Conclusions from N-of-1 observational data
Digital Health
Results
We found that physical activity can increase the nighttime heart rate amplitude, whereas there were no strong conclusions about its suggested effect on total sleep time. Self-reported states such as exercise, yoga, and stress were associated with increased (for the first two) and decreased (last one) average nighttime heart rate.
Conclusions
This study implemented the MoTR method evaluating the suggested effects of daily stressors on nighttime heart rate, sleep…Results
We found that physical activity can increase the nighttime heart rate amplitude, whereas there were no strong conclusions about its suggested effect on total sleep time. Self-reported states such as exercise, yoga, and stress were associated with increased (for the first two) and decreased (last one) average nighttime heart rate.
Conclusions
This study implemented the MoTR method evaluating the suggested effects of daily stressors on nighttime heart rate, sleep time, and physical activity in an individualized way: via the N-of-1 approach. A Python implementation of MoTR is freely available.Other authorsSee publication -
Model-Twin Randomization (MoTR): A Monte Carlo Method for Estimating the Within-Individual Average Treatment Effect Using Wearable Sensors
arXiv
Temporally dense single-person "small data" have become widely available thanks to mobile apps and wearable sensors. Many caregivers and self-trackers want to use these data to help a specific person change their behavior to achieve desired health outcomes. Ideally, this involves discerning possible causes from correlations using that person's own observational time series data. In this paper, we estimate within-individual average treatment effects of physical activity on sleep duration, and…
Temporally dense single-person "small data" have become widely available thanks to mobile apps and wearable sensors. Many caregivers and self-trackers want to use these data to help a specific person change their behavior to achieve desired health outcomes. Ideally, this involves discerning possible causes from correlations using that person's own observational time series data. In this paper, we estimate within-individual average treatment effects of physical activity on sleep duration, and vice-versa. We introduce the model twin randomization (MoTR; "motor") method for analyzing an individual's intensive longitudinal data. Formally, MoTR is an application of the g-formula (i.e., standardization, back-door adjustment) under serial interference. It estimates stable recurring effects, as is done in n-of-1 trials and single case experimental designs. We compare our approach to standard methods (with possible confounding) to show how to use causal inference to make better personalized recommendations for health behavior change, and analyze 222 days of Fitbit sleep and steps data for one of the authors.
Other authorsSee publication -
Estimating the Burden of Influenza-like Illness on Daily Activity at the Population Scale Using Commercial Wearable Sensors
JAMA Network Open
Question How can the true burden of influenza-like illnesses (ILIs) be estimated given that most cases of ILIs are mild and go undocumented ?
Findings This cohort study of 15 122 adults who reported ILI symptoms and had data from wearable sensors at symptom onset found an overall reduction in mobility equivalent to 15% of the active US population becoming completely immobilized for 1 day. More than 60% of this reduction occurred among persons who had sought no medical…Question How can the true burden of influenza-like illnesses (ILIs) be estimated given that most cases of ILIs are mild and go undocumented ?
Findings This cohort study of 15 122 adults who reported ILI symptoms and had data from wearable sensors at symptom onset found an overall reduction in mobility equivalent to 15% of the active US population becoming completely immobilized for 1 day. More than 60% of this reduction occurred among persons who had sought no medical care.
Meaning This study suggests that the burden of ILIs is much greater than had previously been understood.Other authorsSee publication -
Creating Evidence from Real World Patient Digital Data
Frontiers SA
N-of-1 randomized controlled trials (RCTs) provide an opportunity to evaluate individual patient response to interventions, by randomly allocating different time periods within an individual to repeated intervention and control conditions and comparing responses. N-of-1 observational studies involve the repeated measurement of an outcome (e.g. pain) in a patient over time, but with no intervention implemented, in order to draw conclusions about naturally-occurring patterns and predictors of…
N-of-1 randomized controlled trials (RCTs) provide an opportunity to evaluate individual patient response to interventions, by randomly allocating different time periods within an individual to repeated intervention and control conditions and comparing responses. N-of-1 observational studies involve the repeated measurement of an outcome (e.g. pain) in a patient over time, but with no intervention implemented, in order to draw conclusions about naturally-occurring patterns and predictors of outcomes over time. Both N-of-1 RCTs and observational studies can have a ‘self-study’ design, where an individual conducts the study on themselves, to answer research questions they have generated themselves. N-of-1 RCTs and observational studies provide individualized findings that can be aggregated to produce results equivalent to those found in traditional group-based RCTs and population-level epidemiological studies, respectively, but requiring fewer patients for the same power.
N-of-1 RCTs and observational studies are well-suited to complement, strengthen, and generate advances in precision medicine, patient-centred healthcare, and personalised health. Since 2015, the number of N-of-1 articles has doubled annually.
Similarly, digital health is an exploding field, with over 1,000 studies registered on clinicaltrials.gov.
Digital health, and digital therapeutics in particular, complement N-of-1 RCTs and observational studies by providing relevant individualized health data from, for example, worn sensors, implants, regular lab assays, or -omics sequencing. Such data can be compared to population-health databases to target a patient’s strongest possible treatment option (as in cancer-risk studies) and, in turn, inform the design of an N-of-1 RCT to evaluate it.
Digital health data can also be continuously monitored during the study itself and used to help tailor a treatment to the needs and preferences of patients in real time.Other authorsSee publication -
Inequity in California's Smokefree Workplace Laws: A Legal Epidemiologic Analysis of Loophole Closures
American Journal of Preventive Medicine
Introduction
California's landmark 1994 Smokefree Workplace Act contained numerous exemptions, or loopholes, believed to contribute to inequities in smokefree air protections among low-income communities and communities of color (e.g., permitting smoking in warehouses, hotel common areas). Cities/counties were not prevented from adopting stronger laws. This study coded municipal laws and state law changes (in 2015–2016) for loophole closures and determined their effects in reducing…Introduction
California's landmark 1994 Smokefree Workplace Act contained numerous exemptions, or loopholes, believed to contribute to inequities in smokefree air protections among low-income communities and communities of color (e.g., permitting smoking in warehouses, hotel common areas). Cities/counties were not prevented from adopting stronger laws. This study coded municipal laws and state law changes (in 2015–2016) for loophole closures and determined their effects in reducing inequities in smokefree workplace protections.
Methods
Public health attorneys reviewed current laws for 536 of California's 539 cities and counties from January 2017 to May 2018 and coded for 19 loophole closures identified from legislative actions (inter-rater reliability, 87%). The local policy data were linked with population demographics from intercensal estimates (2012–2016) and adult smoking prevalence (2014). The analyses were cross-sectional and conducted in February–June 2019.
Results
Between 1994 and 2018, jurisdictions closed 6.09 loopholes on average (SD=5.28). Urban jurisdictions closed more loopholes than rural jurisdictions (mean=6.40 vs 3.94, p<0.001), and loophole closure scores correlated positively with population size, median household income, and percentage white, non-Hispanic residents (p<0.001 for all). Population demographics and the loophole closure score explained 43% of the variance in jurisdictions’ adult smoking prevalence. State law changes in 2015–2016 increased loophole closure scores and decreased jurisdiction variation (mean=9.74, SD=3.56); closed more loopholes in rural versus urban jurisdictions (meangain=4.44 vs 3.72, p=0.002); and in less populated, less affluent jurisdictions, with greater racial/ethnic diversity, and higher smoking prevalence (p<0.001 for all).
Conclusions
Although jurisdictions made important progress in closing loopholes in smokefree air law, state law changes achieved greater reductions in inequities in policy coverage.Other authorsSee publication -
Effects of Sleep Deprivation on Blood Glucose, Food Cravings, and Affect in a Non-Diabetic: An N-of-1 Randomized Pilot Study
Healthcare, Multidisciplinary Digital Publishing Institute
Abstract: Sleep deprivation is a prevalent and rising health concern, one with known effects on blood glucose (BG) levels, mood, and calorie consumption. However, the mechanisms by which sleep deprivation affects calorie consumption (e.g., measured via self-reported types craved food) are unclear, and may be highly idiographic (i.e., individual specific). Single-case or “n-of-1” randomized trials (N1RT) are useful in exploring such effects by exposing each subject to both sleep deprivation and…
Abstract: Sleep deprivation is a prevalent and rising health concern, one with known effects on blood glucose (BG) levels, mood, and calorie consumption. However, the mechanisms by which sleep deprivation affects calorie consumption (e.g., measured via self-reported types craved food) are unclear, and may be highly idiographic (i.e., individual specific). Single-case or “n-of-1” randomized trials (N1RT) are useful in exploring such effects by exposing each subject to both sleep deprivation and baseline conditions, thereby characterizing effects specific to that individual. We had two objectives: (1) To test and generate individual-specific N1RT hypotheses of the effects of sleep deprivation on next-day BG level, mood, and food cravings in two non-diabetic individuals; (2) To refine and guide a future n-of-1 study design for testing and generating such idiographic hypotheses for personalized management of sleep behavior in particular, and for chronic health conditions more broadly. We initially did not find evidence for an idiographic effect of sleep deprivation, but better-refined post hoc findings indicate that sleep deprivation may have increased BG fluctuations, cravings, and negative emotions. We also introduce an application of mixed-effects models and pancit plots to assess idiographic effects over time.
Other authorsSee publication -
Person as Population: A Longitudinal View of Single-Subject Causal Inference for Analyzing Self-Tracked Health Data
arXiv
Single-subject health data are becoming increasingly available thanks to advances in self-tracking technology (e.g., mobile devices, apps, sensors, implants). Many users and health caregivers seek to use such observational time series data to recommend changing health practices in order to achieve desired health outcomes. However, there are few available causal inference approaches that are flexible enough to analyze such idiographic data. We develop a recently introduced framework, and…
Single-subject health data are becoming increasingly available thanks to advances in self-tracking technology (e.g., mobile devices, apps, sensors, implants). Many users and health caregivers seek to use such observational time series data to recommend changing health practices in order to achieve desired health outcomes. However, there are few available causal inference approaches that are flexible enough to analyze such idiographic data. We develop a recently introduced framework, and implement a flexible random-forests g-formula approach to estimating a recurring individualized effect called the "average period treatment effect". In the process, we argue that our approach essentially resembles that of a longitudinal study by partitioning a single time series into periods taking on binary treatment levels. We analyze six years of the author's own self-tracked physical activity and weight data to demonstrate our approach, and compare the results of our analysis to one that does not properly account for confounding.
-
Causal analysis of self-tracked time series data using a counterfactual framework for n-of-1 trials.
Methods of Information in Medicine
Many of an individual's historically recorded personal measurements vary over time, thereby forming a time series (e.g., wearable-device data, self-tracked fitness or nutrition measurements, clinical/chronic conditions). Statistical analyses of such n-of-1 (i.e., single-subject) observational studies (N1OSs) can be used to discover possible cause-effect relationships to then self-test in an n-of-1 randomized trial (N1RT). However, a principled way of determining how and when to interpret an…
Many of an individual's historically recorded personal measurements vary over time, thereby forming a time series (e.g., wearable-device data, self-tracked fitness or nutrition measurements, clinical/chronic conditions). Statistical analyses of such n-of-1 (i.e., single-subject) observational studies (N1OSs) can be used to discover possible cause-effect relationships to then self-test in an n-of-1 randomized trial (N1RT). However, a principled way of determining how and when to interpret an N1OS association as a causal effect (e.g., as if randomization had occurred) is needed. Our goal in this paper is to help bridge the methodological gap between risk-factor discovery and N1RT testing by introducing a basic counterfactual framework for N1OS design and personalized causal analysis. We introduce and characterize what we call the average period treatment effect (APTE), i.e., the estimand of interest in an N1RT, and build an analytical framework around it that can accommodate autocorrelation and time trends in the outcome, effect carryover from previous treatment periods, and slow onset or decay of the effect. The APTE is loosely defined as a contrast (e.g., difference, ratio) of average outcomes the individual can experience under different treatment levels at a given treatment period. To illustrate the utility of our framework for APTE discovery and estimation, two common causal inference methods are specified within the N1OS context. We then apply the framework and methods to search for estimable and interpretable APTEs using six years of the author's self-tracked weight and exercise data, and report both the preliminary findings and the challenges we faced in conducting N1OS causal discovery. Causal analysis of an individual's time series data can be facilitated by an N1RT counterfactual framework. For inference to be valid, the veracity of certain key assumptions must be assessed critically, and the hypothesized causal models must be interpretable and meaningful.
-
Thyroid cancer mortality higher in Filipinos in United States: an analysis using national mortality records from 2003-2012.
Cancer
Background: Well-differentiated thyroid carcinoma has a favorable prognosis, but patients with multiple recurrences have drastically lower survival. Filipinos in the US are known to have high thyroid cancer incidence and recurrence rates. It is unknown whether Filipinos also have higher thyroid cancer mortality rates.
Methods: We studied thyroid cancer mortality in Filipino, non-Filipino Asian (NFA), and non-Hispanic White (NHW) adults using US death records (2003-2012) and US Census data…Background: Well-differentiated thyroid carcinoma has a favorable prognosis, but patients with multiple recurrences have drastically lower survival. Filipinos in the US are known to have high thyroid cancer incidence and recurrence rates. It is unknown whether Filipinos also have higher thyroid cancer mortality rates.
Methods: We studied thyroid cancer mortality in Filipino, non-Filipino Asian (NFA), and non-Hispanic White (NHW) adults using US death records (2003-2012) and US Census data. Age-adjusted mortality rates (AMRs) and proportional mortality ratios (PMRs) were calculated. Gender, nativity status, age at death, and educational attainment were also examined.
Results: We examined 19,940,952 deaths. AMR due to thyroid cancer was highest in Filipinos (1.72 deaths per 100,000, 95% CI 1.51-1.95) compared to NFAs (1.03 per 100,000, 95% CI 0.95-1.12) and NHWs (1.17 per 100,000, 95% CI 1.16-1.18). Compared to NHWs, higher proportionate mortality was observed in Filipino women (3-5 times higher) across all age groups, and Filipino men had 2-3 times higher PMR in the subgroup over the age of 55. Filipinos that completed higher education had notably higher PMR (5.0) than their counterparts who had not (3.5).
Conclusions: Negative prognostic factors for thyroid cancer traditionally include “age greater than 45 years” and “male gender.” We demonstrate that Filipinos die of thyroid cancer at higher rates than NFAs and NHWs of similar ages. Highly-educated Filipinos and Filipino women may be especially at risk for poor thyroid cancer outcomes. Filipino ethnicity should be factored into clinical decision-making in the management of thyroid cancer.Other authors -
Estimating inverse-probability weights for longitudinal data with dropout or truncation: The xtrccipw command
The Stata Journal
Abstract. Individuals may drop out of a longitudinal study, rendering their outcomes unobserved but still well defined. However, they may also undergo truncation (for example, death), beyond which their outcomes are no longer meaningful. Kurland and Heagerty (2005, Biostatistics 6: 241–258) developed a method to conduct regression conditioning on nontruncation, that is, regression conditioning on continuation (RCC), for longitudinal outcomes that are monotonically missing at random (for…
Abstract. Individuals may drop out of a longitudinal study, rendering their outcomes unobserved but still well defined. However, they may also undergo truncation (for example, death), beyond which their outcomes are no longer meaningful. Kurland and Heagerty (2005, Biostatistics 6: 241–258) developed a method to conduct regression conditioning on nontruncation, that is, regression conditioning on continuation (RCC), for longitudinal outcomes that are monotonically missing at random (for example, because of dropout). This method first estimates the probability of dropout among continuing individuals to construct inverse-probability weights (IPWs), then fits generalized estimating equations (GEE) with these IPWs. In this article, we present the xtrccipw command, which can both estimate the IPWs required by RCC and then use these IPWs in a GEE estimator by calling the glm command from within xtrccipw. In the absence of truncation, the xtrccipw command can also be used to run a weighted GEE analysis. We demonstrate the xtrccipw command by analyzing an example dataset and the original Kurland and Heagerty (2005) data. We also use xtrccipw to illustrate some empirical properties of RCC through a simulation study.
Other authorsSee publication -
"Uncertain times call for certain measurements."
Science: NextGen voices: Advocacy in brief
-
Multiple Cause-of-Death Analysis Reveals Under Reporting of Disease Burden for Diabetes
American Heart Association, Inc.
Introduction: Despite being considerably under reported as the underlying cause of death on death certificates, and consequently on mortality figures, diabetes is among the ten leading causes of death in the U.S. A multiple cause-of-death analysis shows the extent to which diabetes is associated with other leading causes of death.
Hypothesis: Analysis of multiple-cause-of-death will confirm prevalence rates of diabetes among racial/ethnic minority populations, demonstrate the impact of…Introduction: Despite being considerably under reported as the underlying cause of death on death certificates, and consequently on mortality figures, diabetes is among the ten leading causes of death in the U.S. A multiple cause-of-death analysis shows the extent to which diabetes is associated with other leading causes of death.
Hypothesis: Analysis of multiple-cause-of-death will confirm prevalence rates of diabetes among racial/ethnic minority populations, demonstrate the impact of diabetes in association with other causes of death, and highlight variations of burden of disease among different racial/ethnic groups.
Methods: Causes of death were identified using the Multiple Cause Mortality Files of the National Center for Health Statistics from 2003 to 2012. Age-adjusted mortality rates were calculated for diabetes both as the underlying cause of death (UCD) and as multiple causes of death (MCD) by racial/ethnic groups (NHWs, Blacks, Asians, and Hispanic/Latinos). Frequencies and proportions were calculated by race/ethnicity groups. Linear regression model was used for number of causes per death.
Results: A total of 2,335,198 decedents had diabetes listed as MCD in the U.S. national death records from 2003-2012. Mortality rates of diabetes as MCD were 3.4 times than UCD for Asians, 2.9 times for Blacks, 2.9 times for Hispanics and 3.7 times for NHWs (Figure). Minority populations had higher proportion of deaths with diabetes reported as MCD than NHWs (1.7 times higher for Hispanics, 1.5 times higher for Blacks and Asians). Adjusting for age, gender, and race/ethnicity, there were 1.7 more causes per death co-occurred for diabetes decedents compared to decedents who died due to all other causes (95% CI: 1.714, 1.718).
Conclusions: Our findings underscore the importance of a multiple-cause-of-death approach in the analyses for a more comprehensive understanding of the impact of diabetes.Other authorsSee publication -
A Bayesian approach to the g-formula
Statistical Methods in Medical Research
Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health interventions. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin’s original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentist properties of the Bayesian g-formula suggest it improves the accuracy of estimates of causal…
Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health interventions. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin’s original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentist properties of the Bayesian g-formula suggest it improves the accuracy of estimates of causal effects in small samples or when data are sparse. We demonstrate an approach to estimate the effect of environmental tobacco smoke on body mass index among children aged 4–9 years who were enrolled in a longitudinal birth cohort in New York, USA. We provide an algorithm and supply SAS and Stan code that can be adopted to implement this computational approach more generally.
Other authorsSee publication -
Likelihood of unemployed smokers vs nonsmokers attaining reemployment in a one-year observational study
JAMA Internal Medicine
Importance: Studies in the United States and Europe have found higher smoking prevalence among unemployed job seekers relative to employed workers. While consistent, the extant epidemiologic investigations of smoking and work status have been cross-sectional, leaving it underdetermined whether tobacco use is a cause or effect of unemployment.
Conclusions and Relevance: To our knowledge, this is the first study to prospectively track reemployment success by smoking status. Smokers had a…Importance: Studies in the United States and Europe have found higher smoking prevalence among unemployed job seekers relative to employed workers. While consistent, the extant epidemiologic investigations of smoking and work status have been cross-sectional, leaving it underdetermined whether tobacco use is a cause or effect of unemployment.
Conclusions and Relevance: To our knowledge, this is the first study to prospectively track reemployment success by smoking status. Smokers had a lower likelihood of reemployment at 1 year and were paid significantly less than nonsmokers when reemployed. Treatment of tobacco use in unemployment service settings is worth testing for increasing reemployment success and financial well-being.Other authorsSee publication -
Longitudinal regression conditioning on continuation
The University of North Carolina at Chapel Hill, ProQuest Dissertations Publishing
Individuals in a longitudinal study may have missing data for multiple reasons, including intermittent
missed visits or permanent study drop out. Additionally, individuals may experience
a truncating event, such as death, past which the outcomes of interest are no longer meaningful.
Kurland and Heagerty (2005) developed a method to conduct regression conditioning on being
alive (RCA), which constructs inverse-probability weights (IPWs) of the dropout probability
among continuing…Individuals in a longitudinal study may have missing data for multiple reasons, including intermittent
missed visits or permanent study drop out. Additionally, individuals may experience
a truncating event, such as death, past which the outcomes of interest are no longer meaningful.
Kurland and Heagerty (2005) developed a method to conduct regression conditioning on being
alive (RCA), which constructs inverse-probability weights (IPWs) of the dropout probability
among continuing individuals used to t generalized estimating equations (GEE). RCA has
since been extended to allow for intermittent missingness (IM) of outcomes (Shardell and Miller,
2008). We further extend these methods to simultaneously accommodate dierent mechanisms
for dropout and IM, and call our method regression conditioning on continuation (RCC). RCC is
illustrated using data from a recent study of mother-to-child transmission of HIV to draw inference
on mean infant weights subject to truncation from infant death or HIV infection.
Currently, there is no widely available software for conducting RCA. We present the xtrccipw
command in Stata, which can estimate the dropout IPWs required by RCC if there is no IM.
These IPWs estimated using xtrccipw are then used as weights in a GEE estimator using the
glm command, completing the RCC method. In the absence of truncation, the xtrccipw and
glm commands can also be used in a weighted GEE analysis. The xtrccipw command is demonstrated
by analyzing two example datasets and the original Kurland and Heagerty (2005) data. -
Integrating group counseling, cell phone messaging, and participant-generated songs and dramas into a microcredit program increases Nigerian women's adherence to international breastfeeding recommendations.
The Journal of Nutrition
In northern Nigeria, interventions are urgently needed to narrow the large gap between international breastfeeding recommendations and actual breastfeeding practices. Studies of integrated microcredit and community health interventions documented success in modifying health behaviors but typically had uncontrolled designs. We conducted a cluster-randomized controlled trial in Bauchi State, Nigeria, with the aim of increasing early breastfeeding initiation and exclusive breastfeeding among…
In northern Nigeria, interventions are urgently needed to narrow the large gap between international breastfeeding recommendations and actual breastfeeding practices. Studies of integrated microcredit and community health interventions documented success in modifying health behaviors but typically had uncontrolled designs. We conducted a cluster-randomized controlled trial in Bauchi State, Nigeria, with the aim of increasing early breastfeeding initiation and exclusive breastfeeding among female microcredit clients. ... In conclusion, a breastfeeding promotion intervention integrated into microcredit increased the likelihood that women adopted recommended breastfeeding practices. This intervention could be scaled up in Nigeria, where local organizations provide microcredit to >500,000 clients. Furthermore, the intervention could be adopted more widely given that >150 million women, many of childbearing age, are involved in microfinance globally. This trial was registered at clinicaltrials.gov as NCT01352351.
Other authorsSee publication -
Plasma and breast-milk selenium in HIV-infected Malawian mothers are positively associated with infant selenium status but are not associated with maternal supplementation: results of the Breastfeeding, Antiretrovirals, and Nutrition study.
The American Journal of Clinical Nutrition
Background: Selenium is found in soils and is essential for human antioxidant defense and immune function. In Malawi, low soil selenium and dietary intakes coupled with low plasma selenium concentrations in HIV infection could have negative consequences for the health of HIV-infected mothers and their exclusively breastfed infants.
Objective: We tested the effects of lipid-based nutrient supplements (LNS) that contained 1.3 times the Recommended Dietary Allowance of sodium selenite and…Background: Selenium is found in soils and is essential for human antioxidant defense and immune function. In Malawi, low soil selenium and dietary intakes coupled with low plasma selenium concentrations in HIV infection could have negative consequences for the health of HIV-infected mothers and their exclusively breastfed infants.
Objective: We tested the effects of lipid-based nutrient supplements (LNS) that contained 1.3 times the Recommended Dietary Allowance of sodium selenite and antiretroviral drugs (ARV) on maternal plasma and breast-milk selenium concentrations.
...
Conclusions: Selenite supplementation of HIV-infected Malawian women was not associated with a change in their plasma or breast-milk selenium concentrations. Future research should examine effects of more readily incorporated forms of selenium (ie, selenomethionine) in HIV-infected breastfeeding women. This trial was registered at clinicaltrials.gov as NCT00164736.Other authorsSee publication -
Changes in soluble transferrin receptor and hemoglobin concentrations in Malawian mothers are associated with those values in their exclusively breastfed, HIV-exposed infants.
The Journal of Nutrition
Infant iron status at birth is influenced by maternal iron status during pregnancy; however, there are limited data on the extent to which maternal iron status is associated with infant iron status during exclusive breastfeeding. We evaluated how maternal and infant hemoglobin and iron status [soluble transferrin receptors (TfR) and ferritin] were related during exclusive breastfeeding in HIV-infected women and their infants. The Breastfeeding, Antiretrovirals, and Nutrition Study was a…
Infant iron status at birth is influenced by maternal iron status during pregnancy; however, there are limited data on the extent to which maternal iron status is associated with infant iron status during exclusive breastfeeding. We evaluated how maternal and infant hemoglobin and iron status [soluble transferrin receptors (TfR) and ferritin] were related during exclusive breastfeeding in HIV-infected women and their infants. The Breastfeeding, Antiretrovirals, and Nutrition Study was a randomized controlled trial in Lilongwe, Malawi, in which HIV-infected women were assigned with a 2 × 3 factorial design to a lipid-based nutrient supplement (LNS), or no LNS, and maternal, infant, or no antiretroviral drug, and followed for 24 wk. Longitudinal models were used to relate postpartum maternal hemoglobin (n = 1926) to concurrently measured infant hemoglobin, adjusting for initial infant hemoglobin values. In a subsample, change in infant iron status (hemoglobin, log ferritin, log TfR) between 2 (n = 352) or 6 wk (n = 167) and 24 wk (n = 519) was regressed on corresponding change in the maternal indicator, adjusting for 2 or 6 wk values. A 1 g/L higher maternal hemoglobin at 12, 18, and 24 wk was associated with a 0.06 g/L (P = 0.01), 0.10 g/L (P < 0.001), and 0.06 g/L (P = 0.01), respectively, higher infant hemoglobin. In the subsample, a reduction in maternal log TfR and an increase in hemoglobin from initial measurement to 24 wk were associated with the same pattern in infant values (log TfR β = −0.18 mg/L, P < 0.001; hemoglobin β = 0.13 g/L, P = 0.01). Given the observed influence of maternal and initial infant values, optimizing maternal iron status in pregnancy and postpartum is important to protect infant iron status. This trial was registered at clinicaltrials.gov as NCT00164736.
Other authorsSee publication
Courses
-
Advanced Econometrics
ECON 870
-
Advanced Linear Models I
BIOS 762
-
Advanced Statistical Methods in Biometric and Public Health
BIOS 758
-
Advanced Survey Sampling Methods
BIOS 764
-
Analysis of Categorical Data
BIOS 665
-
Bayesian Statistics
BIOS 779
-
Causal Inference in Biomedical Research (audited)
BIOS 776
-
Causal Inference in Public Health
MCH 759
-
Data Management in Clinical and Public Health Research
BIOS 613
-
Demographic Techniques I
BIOS 670
-
Design and Analysis of Clinical Trials
BIOS 752
-
Group Lessons in Voice
MUSC 111
-
Health Effects of Environmental Agents
ENVR 430
-
International and Comparative Health Systems
HPM 660
-
Introductory Survivorship Analysis
BIOS 680
-
Longitudinal Data Analysis
BIOS 767
-
Practice in Statistical Consulting
BIOS 842
-
Principles of Epidemiology
EPID 600
-
Principles of Statistical Consulting
BIOS 841
-
Research Ethics
GRAD 721
-
Social and Behavioral Sciences in Public Health
HBHE 600
-
Structural Equations with Latent Variables
SOCI 717
-
Systematic Review and Meta-Analysis
EPID 730
-
Systems Biology in Environmental Health
ENVR 890
-
Training in Statistical Teaching in the Health Sciences
BIOS 850
-
Biostatistical Methods: Survival Analysis and Causality
PB HLTH C240B
-
Censored Longitudinal Data and Causality
PB HLTH C246A
-
Introduction to Analysis
MATH 104
-
Linear Algebra & Differential Equations
MATH 54
-
Multivariable Calculus
MATH 53
-
Computing for Data Science (audited) (http://statweb.stanford.edu/~naras/stat290/Stat290_Website/Stat_290.html)
STATS 290
Projects
-
N-of-5 Minutes
Five minute chats with experts from all corners of science about statistics that can be applied to a single person. All scientists and non-scientists are welcomed! This podcast is hosted by Stats-of-1 (statsof1.org), a blog on a mission to improve personalized health through digital individualized statistical designs or methods.
Other creators -
Stats-of-1
Stats-of-1 is a newsletter that seeks to improve personalized health by promoting the expanded use of individual-focused, quantitative idiography (QI) statistical designs and methods in health and medicine. Collectively, these QI approaches define the field of esametry (derived from “isa”, the Tagalog Filipino word for “one”).
Inspired by the intra-individual or within-individual approaches of n-of-1 trials, single-case experimental designs, and single-subject research, we are building a…Stats-of-1 is a newsletter that seeks to improve personalized health by promoting the expanded use of individual-focused, quantitative idiography (QI) statistical designs and methods in health and medicine. Collectively, these QI approaches define the field of esametry (derived from “isa”, the Tagalog Filipino word for “one”).
Inspired by the intra-individual or within-individual approaches of n-of-1 trials, single-case experimental designs, and single-subject research, we are building a community of statistics-savvy pioneers who use modern tools of digital health (e.g., wearables, sensors) to create or adapt statistical techniques to study a single individual’s recurring trends. Visit our About page to learn more.Other creatorsSee project -
Causes and Associations in Single-Individual Analysis (CASIA) [pronounced: ka-sha]
-
Goal: To establish the feasibility of applying causal inference methods to a single individual's data to improve causal discovery for personalized/precision health. To develop observational and experimental study design and analysis methods where needed.
Other creatorsSee project -
DISCOVeR, Stanford University School of Medicine, Palo Alto Medical Foundation
-
http://med.stanford.edu/discover.html
http://www.pamf.org/discover/causes.htmlOther creatorsSee project -
The Breastfeeding, Antiretrovirals, and Nutrition Study: Malawi Mothers and Infants (BAN: MaMi)
-
http://www.cpc.unc.edu/news/features/improving-health-and-survival-in-malawi
http://hpdp.unc.edu/research/projects/ban/Other creatorsSee project -
A Statistical New World
-
My talented colleagues and friends produced the vision and lyrics for this educational and entertaining music video based on Disney's "A Whole New World" on how statistics educates and empowers. I was fortunate to be able to work with them, and to lend my theatrical, vocal, and music-production expertise to their project.
Other creatorsSee project -
Non-response in Wave IV of the Add Health Study
-
Non-response is a potential threat to the accuracy of estimates obtained from sample surveys and can be particularly difficult to avoid in longitudinal studies. The objective of this report is to investigate non-response and consequent bias in estimates for Wave IV of the National Longitudinal Study of Adolescent Health (Add Health). The Survey Research Unit at the University of North Carolina at Chapel Hill previously analyzed the non-response rates for the first three waves of Add Health. As…
Non-response is a potential threat to the accuracy of estimates obtained from sample surveys and can be particularly difficult to avoid in longitudinal studies. The objective of this report is to investigate non-response and consequent bias in estimates for Wave IV of the National Longitudinal Study of Adolescent Health (Add Health). The Survey Research Unit at the University of North Carolina at Chapel Hill previously analyzed the non-response rates for the first three waves of Add Health. As shown in Chantala, Kalsbeek and Andraca, 2005, the total bias in Waves I, II, and III for 13 measures of health and risk behaviors rarely exceed 1%, which is small relative to the 20% to 80% prevalence rates for most of these measures. Results are similar for Wave IV.
In this paper, first, we outline the Wave IV sampling design and results of the field work. Second, we characterize the non-response rates overall and stratified by a number of demographic variables. Next, we use data on the health risk measures reported by Wave IV responders and non-responders during their Wave I In-home interview to estimate total and relative bias due to non-response in Wave IV. We conclude with a discussion of Wave IV bias due to non-response.Other creatorsSee project
Honors & Awards
-
Young Investigator Award, 2018 Sage Assembly: Algorithms and the Role of the Individual
Sage Bionetworks
-
Improving personalized medicine through n-of-1 causal inference and predictive modeling
Stanford Center for Clinical and Translational Research and Education (Spectrum), Population Health Sciences
Languages
-
Tagalog
Native or bilingual proficiency
-
Spanish
Limited working proficiency
-
English
Native or bilingual proficiency
-
SAS
Full professional proficiency
-
Stata
Full professional proficiency
-
R
Native or bilingual proficiency
-
SQL
Professional working proficiency
-
Filipino
Native or bilingual proficiency
-
Python
Limited working proficiency
Organizations
-
American Statistical Association: Justice, Equity, Diversity and Inclusion Outreach Group
Member, PDC Chair, JEDI Chair-Elect
-
American Statistical Association, San Francisco Bay Area Chapter
member
-
American Statistical Association
member
-
American Statistical Association
-
-
American Statistical Association
-
-
American Statistical Association, North Carolina Chapter
member
-
Risley Residential College for the Performing Arts
tech crew, actor, tech director
Recommendations received
-
LinkedIn User
3 people have recommended Eric J. Join now to view
More activity by Eric J.
A bit of personal news: I will be stepping down as Dean of the Boston University School of Public Health at the end of the year…
Liked by Eric J. Daza, DrPH, MPS
NEVER STAND STILL || Always happy to be back here! Thanks for the gift of dual masters in public health economics and management… And now, for this…
Liked by Eric J. Daza, DrPH, MPS
This post isn't about me and my work - it's about my wife. After an eighteen year hiatus, she's trying to re-enter the workforce in public health…
Liked by Eric J. Daza, DrPH, MPS
🌟 Exciting Opportunity Alert! 🌟 Bescy is growing quickly, and we're seeking a visionary leader like you to join us as our Vice President of…
Liked by Eric J. Daza, DrPH, MPS
QUESTION FOR SOLO ENTREPRENEURS When it comes to running a business, I tend to get stuck in my ways and don't try new tools nearly enough. Most if…
Liked by Eric J. Daza, DrPH, MPS
Boehringer Ingelheim partners with Walgreens in clinical trials for obesity treatment: 🧪The pair have partnered to enhance clinical trial…
Liked by Eric J. Daza, DrPH, MPS
#ICER is debuting a clinical diversity framework…and found no consistent measures of diversity (no surprise)…#askpatients #patientengagement…
Liked by Eric J. Daza, DrPH, MPS
It's official now. As some of you may know, I have been selected as a Fellow of the American Statistical Association. In my case, the following…
Liked by Eric J. Daza, DrPH, MPS
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More