Skip to content

Commit

Permalink
Dockerfile.ubi: remove leftover flash-attn references
Browse files Browse the repository at this point in the history
  • Loading branch information
dtrifiro committed May 21, 2024
1 parent bd23984 commit 3be261c
Showing 1 changed file with 0 additions and 4 deletions.
4 changes: 0 additions & 4 deletions Dockerfile.ubi
Original file line number Diff line number Diff line change
Expand Up @@ -240,10 +240,6 @@ RUN pip install \
mv /root/.config/vllm/nccl/cu12/libnccl.so.2.18.1 /opt/vllm/lib/ && \
chmod 0755 /opt/vllm/lib/libnccl.so.2.18.1

# Install flash attention (from pre-built wheel)
RUN --mount=type=bind,from=flash-attn-builder,src=/usr/src/flash-attention-v2,target=/usr/src/flash-attention-v2 \
pip install /usr/src/flash-attention-v2/*.whl --no-cache-dir

RUN --mount=type=cache,target=/root/.cache/pip \
pip install \
# additional dependencies for the TGIS gRPC server
Expand Down

0 comments on commit 3be261c

Please sign in to comment.