Fast parallel RNN-transducer for PyTorch

This project supports PyTorch 1.1 and CUDA 10.0. This repository is fork version of https://github.com/HawkAaron/warp-transducer/tree/master/pytorch_binding.

Installation

Install the latest PyTorch.

WARP_RNNT_PATH should be set to the location of a built WarpRNNT (i.e. libwarprnnt.so). This defaults to ../build, so from within a new warp-transducer clone you could build WarpRNNT like this:

git clone https://github.com/edwardyoon/warp-transducer
cd warp-transducer
mkdir build; cd build
cmake ..
make

Otherwise, set WARP_RNNT_PATH to wherever you have libwarprnnt.so installed. If you have a GPU, you should also make sure that CUDA_HOME is set to the home cuda directory (i.e. where include/cuda.h and lib/libcudart.so live). For example:

export CUDA_HOME="/usr/local/cuda"

Now install the bindings: (Please make sure the GCC version >= 4.9)

cd ../pytorch_binding
python setup.py install

If you try the above and get a dlopen error on OSX with anaconda3 (as recommended by pytorch):

cd ../pytorch_binding
python setup.py install
cd ../build
cp libwarprnnt.dylib /Users/$WHOAMI/anaconda3/lib

This will resolve the library not loaded error. This can be easily modified to work with other python installs if needed.

Example to use the bindings below. The expected cost is 4.495666.

import torch
from warprnnt_pytorch import RNNTLoss
rnnt_loss = RNNTLoss()
acts = torch.FloatTensor([[[[0.1, 0.6, 0.1, 0.1, 0.1],
                             [0.1, 0.1, 0.6, 0.1, 0.1],
                             [0.1, 0.1, 0.2, 0.8, 0.1]],
                             [[0.1, 0.6, 0.1, 0.1, 0.1],
                             [0.1, 0.1, 0.2, 0.1, 0.1],
                             [0.7, 0.1, 0.2, 0.1, 0.1]]]])
labels = torch.IntTensor([[1, 2]])
act_length = torch.IntTensor([2])
label_length = torch.IntTensor([2])
 
acts = torch.nn.functional.log_softmax(torch.autograd.Variable(acts), dim=3).data
acts = torch.autograd.Variable(acts, requires_grad=True)

# if you use CUDA
acts.cuda()

labels = torch.autograd.Variable(labels)
act_length = torch.autograd.Variable(act_length)
label_length = torch.autograd.Variable(label_length)
loss = rnnt_loss(acts, labels, act_length, label_length)
loss.backward()
print(loss)

Documentation

RNNTLoss(size_average=True, blank_label=0):
    """
    size_average (bool): normalize the loss by the batch size (default: True)
    blank_label (int): blank label index
    """

forward(acts, labels, act_lens, label_lens):
    """
    acts: Tensor of (batch x seqLength x labelLength x outputDim) containing output from network
    labels: 2 dimensional Tensor containing all the targets of the batch with zero padded
    act_lens: Tensor of size (batch) containing size of each output sequence from the network
    label_lens: Tensor of (batch) containing label length of each example
    """

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
include		include
pytorch_binding		pytorch_binding
src		src
tests		tests
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast parallel RNN-transducer for PyTorch

Installation

Documentation

About

Releases

Packages

Languages

edwardyoon/warp-transducer

Folders and files

Latest commit

History

Repository files navigation

Fast parallel RNN-transducer for PyTorch

Installation

Documentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages