Melvin's digital garden

Team Data Ninja (NUS) at Data Science Game 2016

[2016-11-14 Mon 19:00:00] event: DataScienceSG

Qualifier is an image classification challenge held on Kaggle

  • orientiation of roof affects solar production
    • north-south
    • east-west
    • flat
    • others
  • determine orientation of roof from satellite images
  • opensolarmap.org
  • evaluation metric: accuracy

Approaching the qualifier problem

  • augment the training dataset
    • rotate north-south into east-west, vice-versa
    • slight rotations, shearing, zoom
  • XGBoost got 56% accuracy
  • various convolution neural net
    • ResNet produced the highest accuracy
  • ensemble by allowing other weaker models to overturn the result from ResNet
  • if the majority class is different from ResNet

Top 20 teams will qualify for the final round in Paris.

Finals is a quote conversion challenge from AXA

  • user gets quote for insurance from various brokers
  • evaluation metric: log-loss
  • feature engineering
    • user more likely to convert on anniversay
    • topic modelling with LDA
  • decided on XGBoost

Links to this note