leetgpu-challenges

该项目提供了一系列类似 LeetCode 风格的 GPU 编程练习题，内含参考答案、测试用例和多种 GPU 编程框架的模板代码。

This project provides a series of GPU programming exercises in the style of LeetCode, including answ

GPU 编程实战挑战

这是一份专为高性能计算（HPC）初学者准备的 CUDA 教程与题库，包含 200 个 CUDA 实现的算子、学习笔记以及手搓性能对标官方的 HGEMM、Flas

This is a CUDA tutorial and question bank prepared specifically for beginners in high-performance co

该项目是由 NVIDIA 提供的 Python 库，旨在将 CUDA 的高性能计算能力与 Python 的高效开发体验相结合。它由多个组件构成，包括 cuda.

This Python library, provided by NVIDIA, aims to integrate the high-performance computing capabiliti

这是一款专为 Hopper 架构 GPU 设计的高效 MLA 解码内核，旨在提升大规模语言模型（LLM）的推理效率。它采用 C++ 和 CUDA 开发，通过 N

This is an efficient MLA decoding kernel designed specifically for Hopper architecture GPUs, aiming 

这是一个高效易用的大型语言模型推理引擎，专为解决推理速度慢、资源利用率低等问题而设计。它基于 PyTorch 和 CUDA，并结合内存优化算法（PagedAtt

This is a highly efficient and user-friendly large language model inference engine, specifically des

该项目是基于 CUDA 的 2D 粒子引擎构建的人工生命模拟工具。它提供了图形化用户界面和粒子编辑器，能够轻松模拟软体、流体、数字生物体、遗传和进化等过程。生物

This project is an artificial life simulation tool based on a 2D particle engine built with CUDA. It

这是一个利用 GPU 加速数值计算的 Python 库，与 NumPy 和 SciPy 兼容。你可以轻松地将现有的 NumPy/SciPy 代码，迁移到 NVI

This is a Python library that utilizes GPU acceleration for numerical computing, which is compatible

该项目提供了 14 道题，帮助学习 GPU 编程。你需要编写代码来解决这些问题。尽管代码看起来像 Python，但实际上是使用 numba 库编写 CUDA 代

This project offers 14 exercises to help learn GPU programming. You are required to write code to so