*import RIVM data en maak variabele Prevalentie per dag per gemeente import delimited https://data.rivm.nl/covid-19/COVID-19_aantallen_gemeente_per_dag.csv rename municipality_code gmcode1 gen date = date(date_of_publication,"YMD") format %td date drop date_of_publication merge m:m gmcode1 using "I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\InwonersaantalBevolkingsdichtheidGM.dta" keep if _merge==3 drop _merge by gmcode1 date, sort: egen cases=sum(total_reported) by gmcode1 date: gen n=_n keep if n==1 gen incidentiecapita=cases/inwonersgemeente ssc instal rangestat rangestat (sum) PrevalentieCovGM= incidentiecapita, interval(date -3 5) by(gmcode1) save I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\DataRIVM\RIVMdataDagGemeenteNEWSPORT, replace clear *START WITH COVID RADAR DATA use "I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\CoViDradarDataNEWwithoutExcludesCleaned.dta" *Drop prior januari 2021 keep if date>22274 *select Transmision period before positive test (realnewdif) or negative test (realnewnegadif) sort newid date gen periode=0 replace periode=1 if realnewdif>-21 & realnewdif<-9 | realnewnegadif>-21 & realnewnegadif<-9 keep if periode==1 *selecteer relevante variabelen keep newid sportnew realnewnegadif realnewdif agenum age gendernum RLBRMTR ooitpos vaccin gmcode1 date nabij *Drop observations of tested users within 60 days gen positivedate=date-realnewdif format %td positivedate gen negativedate=date-realnewnegadif format %td negativedate gen dif=negativedate - positivedate drop if dif>-61 & dif<61 *drop if realnewdif>-21 & realnewdif<-9 & realnewnegadif>-21 & realnewnegadif<-9 *Gen outcome variable gen positive=0 replace positive=1 if realnewdif>-21 & realnewdif<-9 *Gen eventvariable (in case one individual has both positive or negative testresult) drop idn sort newid positive by newid positive: gen idn=_n`' by newid positive: gen event= 1 if _n==1 replace event= sum(event) sort event *drop if no sport information drop if mi(sportnew) *gen vaccinationstatus replace vaccin=1 if vaccin==2 replace vaccin=0 if mi(vaccin) by event: egen vacstat=min(vaccin) replace vacstat=0 if mi(vacstat) *gen maxsport1 = ever not sported *gen maxsport2 = ever inside sported *gen maxsport3 = ever outside sported *gen maxsport4 = ever other sported *gen sumsport1 = number of days not sported *gen sumsport2 = number of days inside sported *gen sumsport3 = number of days outside sported *gen sumsport4 = number of days other sported tab sportnew, gen(sport) foreach v of varlist sport1-sport4{ by event: egen max`v'=max(`v') by event: egen sum`v'=sum(`v') } *drop if never sported drop if maxsport1==1 & maxsport2==0 & maxsport3==0 & maxsport4==0 *import data van RIVM en maak gemiddelde prevalentie per event merge m:1 gmcode1 date using "I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\DataRIVM\RIVMdataDagGemeenteNEWSPORT" keep if _merge==3 *gen gemiddelde prevalentie per tijdvak (PrevalentieCovGM is som van incidentie per gemeente van 3 dagen terug tot en met 5 dagen in de toekomst) sort event gen prevalentieperhd=PrevalentieCovGM*100000 by event: egen meanprevalentie=mean(prevalentieperhd) *middel gedrag per event by event: egen meannabij=mean(nabij) *drop all users with inprecise information about place of sport drop if maxsport4==1 *tel aantal observaties per event drop idn by event: gen idn=_n by event: gen obs=_N *Maak leeftijdsklasses gen leef=0 if age=="0-5" | age=="06-11" | age=="12-18" replace leef=1 if age=="40-49" | age=="30-39" | age=="19-29" replace leef=2 if age=="60-69" | age=="50-59" replace leef=3 if age=="70-79" | age=="80+" *Maak niet binaire gender vrouw en man=1 replace gendernum=3 if gendernum==2 | gendernum==1 replace gendernum=1 if gendernum==0 replace gendernum=0 if gendernum==3 * leeftijdsverdeling binnensporten vs buitensporten tab leef maxsport2 if idn==1, row chi * vaccinatiestatus binnensporten vs buitensporten tab vacstat maxsport2 if idn==1, row chi * gender binnensporten vs buitensporten tab gendernum maxsport2 if idn==1, row chi *mean obs by binnensporten ttest obs if idn==1, by(maxsport2) *mean Leefbaromter by binnensporten ttest RLBRMTR if idn==1, by(maxsport2) *mean gedrag en prevalentie by binnensporten ttest meannabij if idn==1, by(maxsport2) ttest meanprevalentie if idn==1, by(maxsport2) * testuitslag binnensporten tab maxsport2 positive if idn==1, row * bereken OR binnensporten (univariaat en multivariaat) ssc instal gllamm xi: gllamm positive maxsport2 if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform xi: gllamm positive maxsport2 i.leef RLBRMTR meannabij i.gendernum vacstat meanprevalentie if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform * subanalyse als nooit buiten sporten gen alwaysinside=0 if maxsport3==1 & maxsport2==0 replace alwaysinside=1 if maxsport2==1 & maxsport3==0 tab alwaysinside positive if idn==1, row xi: gllamm positive alwaysinside if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform xi: gllamm positive alwaysinside i.leef RLBRMTR meannabij i.gendernum vacstat meanprevalentie if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform *subanalyse aantal keer buitensporten tab sumsport2 positive if idn==1, row xi: gllamm positive sumsport2 if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform xi: gllamm positive sumsport2 i.leef RLBRMTR meannabij i.gendernum vacstat meanprevalentie if idn==1,i(newid) fam(binom) link(logit) robust nip(12) adapt eform save "I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\Sporten.dta", replace *gen number of weekly sporters use "I:\ONDERZOEK\PROJECTEN\COVIDRADAR\Data\COVIDradarData\CoViDradarDataNEWwithoutExcludesCleaned.dta" sort week newid tab sportnew, gen(sport) foreach v of varlist sport1-sport4{ by week newid: egen max`v'=max(`v') } by week newid: gen n=_n keep if n==1 gen nosport=0 replace nosport=1 if maxsport1==1 & maxsport2==0 & maxsport3==0 & maxsport4==0 by week: egen meannosport=mean(nosport) by week: gen wkn=_n twoway(line meannosport week if wkn==1) codebook meannosport if wkn==1 & !mi(sportnew)