Tencent2019_Preliminary_Rank1st

The code for 2019 Tencent College Algorithm Contest, and the online result ranks 1st in the preliminary.

  • Owner: guoday/Tencent2019_Preliminary_Rank1st
  • Platform:
  • License::
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

1. 题目介绍

请直接查看guide.pdf了解赛题,该项目是初赛第一名的模型。

2.模型介绍

avatar

avatar

avatar

3. 配置环境

  • scikit-learn
  • tqdm
  • pandas
  • numpy
  • scipy
  • tensorFlow=1.12.0 (其他版本≥1.4且不等于1.5或1.6)
  • Linux Ubuntu 16.04, 128G内存(64G应该足够),一张显卡

4.数据下载

mkdir data 
cd data
#Download data from https://pan.baidu.com/s/1ASQMms_u70psRgW_KEyT2Q 
#Password: burw
unzip algo.qq.com_641013010_testa.zip imps_log.zip user.zip
cd ..

5.数据预处理

python src/preprocess.py

6.提取特征

python src/extract_feature.py

7.转换数据格式

python src/convert_format.py

1)缺失值NA用0填充

2)将Word2Vec和DeepWalk得到的embedding拼接起来,并且掩盖到5%的广告

3)将需要用key-values的稠密特征正则化到[0,1]之间

8.训练模型

mkdir submission
python train.py

Main metrics

Overview
Name With Ownerguoday/Tencent2019_Preliminary_Rank1st
Primary LanguagePython
Program languagePython (Language Count: 1)
Platform
License:
所有者活动
Created At2019-07-10 14:56:15
Pushed At2021-04-02 10:57:14
Last Commit At2020-10-08 22:38:41
Release Count0
用户参与
Stargazers Count708
Watchers Count17
Fork Count278
Commits Count16
Has Issues Enabled
Issues Count4
Issue Open Count0
Pull Requests Count0
Pull Requests Open Count0
Pull Requests Close Count2
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private