Primary competition visual

AirQo Ugandan Air Quality Forecast Challenge

Helping Uganda
$5 000 USD
Completed (almost 6 years ago)
Forecast
902 joined
327 active
Starti
Mar 14, 20
Closei
May 31, 20
Reveali
May 31, 20
PCA
Data · 24 May 2020, 12:05 · 2

Since the number of features is large ( 121 x 6 + few extra features), I thought I could lower the dimension by using PCA. But I see that we can't perform PCA over data containing Nan values. Any suggestions on how I can go forward without having to impute data?

Discussion 2 answers

I think you cannot perform a PCA if you have nans, since it is a basically a matrix factorization. One way or another, you'll have to impute missing values with a certain strategy

24 May 2020, 12:26
Upvotes 0

Yeah, makes sense, thanks pednt!