[doc] Update Readme.md and NPU-Support.md#234
Conversation
Add self-contained causal_conv1d kernel module (no mindspeed_ops dependency) with full Triton forward/backward implementations adapted from MindSpeed-Ops. Patch monkey_patch_npu to bind npu_causal_conv1d_fn on NPU-patched modules, remove torch fallback in linear_attention_sp, and add NPU-aware causal_conv1d wrapper in gdn_padding_free (no transpose needed, [B,T,D] native format).
Update NPU-Support.md
There was a problem hiding this comment.
Code Review
This pull request updates the documentation (READMEs and NPU support guides in both English and Chinese) to detail Qwen3.5/3.6 FLA compatibility, Triton-Ascend and MindSpeed version requirements, and NPU patch environment variables. The review feedback highlights a version inconsistency in the English guide (listing MindSpeed as 26.0.0_core_r0.12.1 instead of 2.3.0_core_r0.12.1) and requests clarification on whether the lack of FP8 support on Ascend A3 is a hardware or software-level limitation.
Important
The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.
PR type