Compute

Parallax Compute

Decentralized P2P GPU inference network

active2026

A peer-to-peer network where anyone can contribute GPU compute for AI inference, earning credits they can spend on inference themselves. Implements pipeline sharding to split large models across multiple consumer GPUs for faster inference.

Tech Stack

PythonFastAPIWebSocketDockerllama.cppMLXHuggingFace Transformers

Python

Features

Pipeline sharding across consumer GPUs

Commit-reveal verification

Reputation system

Pure utility token (credits = inference)

OpenAI-compatible API endpoint

Monitoring dashboard

Multi-backend support

Links

View on GitHub

Architecture

Architecture diagram coming soon.