
Company News
Socket Named Top Sales Organization by RepVue
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.
@infatoshi/gpu-mcp-server
Advanced tools
Pure Node.js Model Context Protocol server for GPU CUDA compilation via Modal. No Python dependency - just install and run CUDA code on Modal GPUs.
Pure Node.js MCP server for compiling and running CUDA code on Modal's cloud GPUs. No Python dependencies - just install via NPM and run CUDA code on B200/H100 GPUs directly from Cursor or any MCP-compatible editor.
npm install -g @infatoshi/gpu-mcp-serverpip install modal
modal token new
npm install -g @infatoshi/gpu-mcp-server
Add to your ~/.cursor/mcp.json or VS Code MCP config:
{
"mcpServers": {
"gpu-cuda": {
"command": "gpu-mcp",
"args": []
}
}
}
That's it! No environment variables needed - the server uses smart defaults.
In Cursor, just ask the AI to compile your CUDA code:
"Compile and run vector_add.cu"
The MCP tool only requires one parameter:
files_glob: Pattern to match CUDA files (e.g., "*.cu", "src/**/*.cu")Optional parameters for fine-tuned control:
{
"files_glob": "kernels/*.cu",
"project_root": "/path/to/project", // Auto-detected if omitted
"entry": "main.cu", // Auto-detected if omitted
"arch": "sm_90", // Auto-detected from MODAL_GPU if omitted
"extra_flags": ["-O3", "--use_fast_math"],
"run_args": ["1000000"]
}
# GPU type (default: B200)
export MODAL_GPU=B200 # or H100, H200, A100
# Modal app name (default: cuda-kernel-runner)
export MODAL_APP=my-cuda-app
| GPU Type | Architecture | Compute Capability |
|---|---|---|
| B200 (default) | sm_100 | 10.0 |
| H100/H200 | sm_90 | 9.0 |
| A100 | sm_80 | 8.0 |
Create vector_add.cu:
#include <stdio.h>
__global__ void vectorAdd(float *a, float *b, float *c, int n) {
int i = blockDim.x * blockIdx.x + threadIdx.x;
if (i < n) c[i] = a[i] + b[i];
}
int main() {
const int N = 1000000;
// ... CUDA setup code ...
vectorAdd<<<(N+255)/256, 256>>>(d_a, d_b, d_c, N);
printf("Vector addition successful!\n");
return 0;
}
In Cursor:
"Run vector_add.cu on GPU"
The server automatically:
vector_add.cu as entry file-O3 --use_fast_math"Not authenticated with Modal"
modal token new
"Function not found"
modal app list to verify deploymentWant to see what's happening?
MIT
infatoshi
FAQs
Pure Node.js Model Context Protocol server for GPU CUDA compilation via Modal. No Python dependency - just install and run CUDA code on Modal GPUs.
We found that @infatoshi/gpu-mcp-server demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Company News
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.

Security News
NIST will stop enriching most CVEs under a new risk-based model, narrowing the NVD's scope as vulnerability submissions continue to surge.

Company News
/Security News
Socket is an initial recipient of OpenAI's Cybersecurity Grant Program, which commits $10M in API credits to defenders securing open source software.