Release Technical Preview · sophgo/tpu-mlir

TPU-MLIR Project Update

Fix Dependency: Fixed the dependency of MLIRInputConversion.
SDK Release Workflow: Fixed tpu-mlir tag for building and added workflow file for SDK release.
Softplus LoweringINT8: Fixed 1684 Softplus LoweringINT8 issue.
Slice Begin Index: Fixed bm1684 slice begin_index problem.
Mul Conflict Resolution: Partially fixed the output data sign of mul conflict with chip restriction.

Subgraph Split Support: Enhanced support for subgraph split.
Quant IO List Note: Added quant io list note for better quantization handling.
New Full Operation: Supported the aten::new_full operation.
Torch Flip for bm1684x: Added support for torch.flip for bm1684x.
Weight Input Shape Bind: Supported shape bind for weight input.

Kernel Module Usage: Reverted to using the old kernel module.
MLIR Conv2D Optimization: Improved 1684 mlir conv2d with 3ic optimization.
SWINT Quantization: Added swint quant for better performance.
Opt Parameter Addition: Added an optimization parameter.
Loop and Fusion Enhancements: Supported interchange of inner loop, padOp transform, tensor op collapse, fusion on linalg-on-tensor, etc.