-
Combining Raw DNA files from 23andMe & AncestryDNA
Recently I’ve been very interested in admixture/geneology, though sadly results for East Asian heritage are lacking in terms of detail and usefulness if you test with a Western company. Wegene results are kinda sketchy in terms of QA in my opinion, not to mention their standards for calculating admixture are somewhat questionable especially for imported data, but still quite interesting with a somewhat active community which provides some insight into how different provinces have different admixture proportions. …
-
ARIMA Models for Time Series Analysis
I know there is the auto.arima function from the forecast library, though we had a homework where we were supposed to manually find candidate ARIMA models for Unemployment rate prediction by examining its Acf & Pcf graphs. …
-
P-value elimination in R
Update: Ok… I know p-value elimination for model-building is deeply flawed per https://robjhyndman.com/hyndsight/crossvalidation/, though this nonetheless helped me save a lot of time during my midterms~ …
-
Notes on Module 3
Today I went through Module 3. This time the lab exercises were easy, but I had a hard time understanding parallel coordinates. …
-
Notes on Module2 of Python for Data Science edX Course
Ok…I got stuck on the Lab 3 questions for Module 2 of the “DAT210x Programming with Python for Data Science” course. In particular, I got stuck at Q2 around July 15 and didn’t proceed forward since I was going to LA on July 19 to obtain my MBA degree at USC. …
-
Set Up My New Site!
After flailing around with my dev environment (uninstalling/deleting/reinstalling homebrew, reinstalling ruby with rbenv, etc), I finally have my Jekyll site ready. Since I’m focusing on learning Python for data analytics I should keep myself from getting distracted by learning CSS & JavaScript. Hence, I used the Jekyll-Uno theme as it fits my taste the most for the time being~ …