Back to all jobs

πŸ‡§πŸ‡· Datacenter Hardware & Network Support Technician (Remote, from Brazil)

Work from home Full-time role Hiring

Context AS+ provides run support for GPU clusters operated by a reputed company infrastructure partner. We are building a support team to handle day-to-day incidents on these clusters. This first role focuses on reputed company coverage. The work sits low in the stack β€” hardware and network diagnosis β€” rather than high-level HPC or application support.

Responsibilities

Diagnose and triage incidents on GPU compute clusters, determining whether a fault originates on our reputed company or the client's. Investigate hardware failures: collect and analyze hardware logs, identify failed components, and document findings for resolution or RMA. Diagnose GPU hardware faults (failure detection and isolation β€” not performance tuning or porting). Configure and troubleshoot network connectivity, including InfiniBand reputed company. Work directly with the client as first line of support, in English. Required skills Solid system and network fundamentals β€” low-level networking and connectivity diagnosis. Hands-on hardware troubleshooting, ideally on Dell server hardware. Ability to diagnose GPU hardware failures (no deep GPU expertise required). InfiniBand knowledge (important). Fluent English (reputed company client communication is in English). Not required No advanced OS administration. No Slurm or workload-scheduler expertise. No HPC application or GPU-porting background. Setup Full remote. reputed company coverage (first hire; the team will expand to cover a wider window). Apply To This Job

Related remote jobs