multi-class-text-classification-cnn-rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

Github星跟蹤圖

Project: Classify Kaggle San Francisco Crime Description

Highlights:

  • This is a multi-class text classification (sentence classification) problem.
  • The goal of this project is to classify Kaggle San Francisco Crime Description into 39 classes.
  • This model was built with CNN, RNN (LSTM and GRU) and Word Embeddings on Tensorflow.

Data: Kaggle San Francisco Crime

  • Input: Descript

  • Output: Category

  • Examples:

    Descript, Category
    -----------, -----------
    GRAND THEFT FROM LOCKED AUTO, LARCENY/THEFT
    POSSESSION OF NARCOTICS PARAPHERNALIA, DRUG/NARCOTIC
    AIDED CASE, MENTAL DISTURBED, NON-CRIMINAL
    AGGRAVATED ASSAULT WITH BODILY FORCE, ASSAULT
    ATTEMPTED ROBBERY ON THE STREET WITH A GUN, ROBBERY

Train:

  • Command: python3 train.py train_data.file train_parameters.json
  • Example: python3 train.py ./data/train.csv.zip ./training_config.json

Predict:

  • Command: python3 predict.py ./trained_results_dir/ new_data.csv
  • Example: python3 predict.py ./trained_results_1478563595/ ./data/small_samples.csv

Reference:

主要指標

概覽
名稱與所有者jiegzhan/multi-class-text-classification-cnn-rnn
主編程語言Python
編程語言Python (語言數: 1)
平台
許可證Apache License 2.0
所有者活动
創建於2016-10-28 16:55:06
推送於2018-03-23 17:46:57
最后一次提交2018-03-23 10:46:56
發布數0
用户参与
星數599
關注者數52
派生數263
提交數79
已啟用問題?
問題數38
打開的問題數30
拉請求數3
打開的拉請求數1
關閉的拉請求數0
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?