Skip to content
View htprofessor's full-sized avatar

Block or report htprofessor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.

C++ 297 50 Updated Mar 15, 2022

An MLIR-based toolchain for AMD AI Engine-enabled devices.

MLIR 284 82 Updated Sep 19, 2024

This package includes the implementation for Sparse-Triangular-Solve (SpTRSV) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2. It is duplicated from https://github.com…

C 5 1 Updated Jun 5, 2020

An auto-tuning framework to accelerate Sparse Triangular Solve on GPU

Cuda 2 Updated May 9, 2023

[FPGA 2024] Source code and bitstream for LevelST: Stream-based Accelerator for Sparse Triangular Solver

Tcl 9 1 Updated Jan 5, 2024

Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators

C++ 81 26 Updated Aug 16, 2024
C++ 7 Updated Jan 16, 2024

CHARM: Composing Heterogeneous Accelerators on Versal ACAP Architecture

C++ 119 17 Updated Aug 12, 2024

Examples utilizing cuSolver and cuSolverMg

Cuda 3 Updated Jun 23, 2021

A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library

C++ 18 2 Updated Dec 19, 2019

Systolic array implementations for Cholesky, LU, and QR decomposition

C++ 38 6 Updated Jun 28, 2019

🌱🚀分享基于Servlet、SSH、SSM、SpringBoot、SpringCloud等流行技术实现的JavaWeb项目,难度分为5个等级,帮助小白入门JavaWeb开发,协助JavaWeb开发者熟悉最新技术

1,191 156 Updated Jun 25, 2024