SPEAR (Scalable Platform for Edge AI Real-Time) is designed to manage the full lifecycle of AI inference at the edge. Developers can define streaming pipelines that ingest sensor data, videos, or logs via connectors to Kafka, MQTT, or HTTP sources. SPEAR dynamically deploys containerized models to worker nodes, balancing loads across clusters while ensuring low-latency responses. It includes built-in model versioning, health checks, and telemetry, exposing metrics to Prometheus and Grafana. Users can apply custom transformations or alerts through a modular plugin architecture. With automated scaling and fault recovery, SPEAR delivers reliable real-time analytics for IoT, industrial automation, smart cities, and autonomous systems in heterogeneous environments.