multi-class-text-classification-cnn-rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

Github星跟踪图

Project: Classify Kaggle San Francisco Crime Description

Highlights:

  • This is a multi-class text classification (sentence classification) problem.
  • The goal of this project is to classify Kaggle San Francisco Crime Description into 39 classes.
  • This model was built with CNN, RNN (LSTM and GRU) and Word Embeddings on Tensorflow.

Data: Kaggle San Francisco Crime

  • Input: Descript

  • Output: Category

  • Examples:

    Descript, Category
    -----------, -----------
    GRAND THEFT FROM LOCKED AUTO, LARCENY/THEFT
    POSSESSION OF NARCOTICS PARAPHERNALIA, DRUG/NARCOTIC
    AIDED CASE, MENTAL DISTURBED, NON-CRIMINAL
    AGGRAVATED ASSAULT WITH BODILY FORCE, ASSAULT
    ATTEMPTED ROBBERY ON THE STREET WITH A GUN, ROBBERY

Train:

  • Command: python3 train.py train_data.file train_parameters.json
  • Example: python3 train.py ./data/train.csv.zip ./training_config.json

Predict:

  • Command: python3 predict.py ./trained_results_dir/ new_data.csv
  • Example: python3 predict.py ./trained_results_1478563595/ ./data/small_samples.csv

Reference:

主要指标

概览
名称与所有者jiegzhan/multi-class-text-classification-cnn-rnn
主编程语言Python
编程语言Python (语言数: 1)
平台
许可证Apache License 2.0
所有者活动
创建于2016-10-28 16:55:06
推送于2018-03-23 17:46:57
最后一次提交2018-03-23 10:46:56
发布数0
用户参与
星数599
关注者数52
派生数263
提交数79
已启用问题?
问题数38
打开的问题数30
拉请求数3
打开的拉请求数1
关闭的拉请求数0
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?