MCPcopy
hub / github.com/dennybritz/cnn-text-classification-tf

github.com/dennybritz/cnn-text-classification-tf @main sqlite

repository ↗ · DeepWiki ↗
10 symbols 34 edges 4 files 6 documented · 60%
README

This code belongs to the "Implementing a CNN for Text Classification in Tensorflow" blog post.

It is slightly simplified implementation of Kim's Convolutional Neural Networks for Sentence Classification paper in Tensorflow.

Requirements

  • Python 3
  • Tensorflow > 0.12
  • Numpy

Training

Print parameters:

./train.py --help
optional arguments:
  -h, --help            show this help message and exit
  --embedding_dim EMBEDDING_DIM
                        Dimensionality of character embedding (default: 128)
  --filter_sizes FILTER_SIZES
                        Comma-separated filter sizes (default: '3,4,5')
  --num_filters NUM_FILTERS
                        Number of filters per filter size (default: 128)
  --l2_reg_lambda L2_REG_LAMBDA
                        L2 regularizaion lambda (default: 0.0)
  --dropout_keep_prob DROPOUT_KEEP_PROB
                        Dropout keep probability (default: 0.5)
  --batch_size BATCH_SIZE
                        Batch Size (default: 64)
  --num_epochs NUM_EPOCHS
                        Number of training epochs (default: 100)
  --evaluate_every EVALUATE_EVERY
                        Evaluate model on dev set after this many steps
                        (default: 100)
  --checkpoint_every CHECKPOINT_EVERY
                        Save model after this many steps (default: 100)
  --allow_soft_placement ALLOW_SOFT_PLACEMENT
                        Allow device soft device placement
  --noallow_soft_placement
  --log_device_placement LOG_DEVICE_PLACEMENT
                        Log placement of ops on devices
  --nolog_device_placement

Train:

./train.py

Evaluating

./eval.py --eval_train --checkpoint_dir="./runs/1459637919/checkpoints/"

Replace the checkpoint dir with the output from the training. To use your own data, change the eval.py script to load your data.

References

Core symbols most depended-on inside this repo

preprocess
called by 1
train.py
train
called by 1
train.py
train_step
called by 1
train.py
dev_step
called by 1
train.py
clean_str
called by 1
data_helpers.py
main
called by 0
train.py
load_data_and_labels
called by 0
data_helpers.py
batch_iter
called by 0
data_helpers.py

Shape

Function 8
Class 1
Method 1

Languages

Python100%

Modules by API surface

train.py5 symbols
data_helpers.py3 symbols
text_cnn.py2 symbols

For agents

$ claude mcp add cnn-text-classification-tf \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact