Name		Name	Last commit message	Last commit date
parent directory ..
model		model
tests		tests
README.rst		README.rst
__init__.py		__init__.py
contractions.yml		contractions.yml
hyperparams.yml		hyperparams.yml
model_setup.py		model_setup.py
util.py		util.py

README.rst

Summarization using LSTM

A simpler LSTM based approach until I get the Pointer Generator Network to perform.

This model implements abstractive text summarization. It uses a character-level seq2seq model to predict summaries. The model employs a biLSTM architecture.

Character-level Sequence-to-sequence Algorithm:

Start with input sequences from a domain (e.g. text documents) and corresponding target sequences from another domain (e.g. text summaries).
An encoder LSTM transforms input sequences into two state vectors. (We keep the last LSTM state and discard the outputs.)
A decoder LSTM is trained to transform the target sequences into the same sequence but offset by one timestep in the future - a training process known as "teacher forcing" in this context. It uses the state vectors from the encoder as initial state. Essentially, the decoder learns to generate 'targets[t + 1...]' given 'targets[...t]', conditioned on the input sequence.
In inference mode, to decode unseen input sequences:
- Encode the input sequence into state vectors
- Start with a target sequence of size 1 (just the "start-of-sequence character")
- Feed the state vectors and 1-char target sequence into the decoder to produce predictions of the next character
- Sample the next character using these predictions (using argmax)
- Append the sampled character to the target sequence
- Repeat until the "end-of-sequence character" is generated or we reach the character limit.

Relevant datasets:

English to French sentence pairs
Lots of neat sentence pairs datasets can be found at http://www.manythings.org/anki/

References:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

summarization

summarization

README.rst

Summarization using LSTM

Files

summarization

Directory actions

More options

Directory actions

More options

Latest commit

History

summarization

Folders and files

parent directory

README.rst

Summarization using LSTM