NVIDIA Logo

NVIDIA

Senior Software Engineer, Observability and AIOps

Posted 22 Days Ago
Be an Early Applicant
Remote
5 Locations
168K-322K
Expert/Leader
Remote
5 Locations
168K-322K
Expert/Leader
Develop a smart network infrastructure for NVIDIA leveraging AIOps, machine learning, and automation for efficient operations and service integration.
The summary above was generated by AI

Imagine a world where the network is self-managed, and self-healing, and requires minimal manual intervention to sustain business operations. A world where the network learns from past events to recommend actions to users. Or better yet, a network that proactively prevents actions with high probability of causing disruption. This network is advanced and intelligent where disruptions are minimized and emerging technology is easily integrated to maintain a first-class service for our business. If that sounds exciting, NVIDIA is looking for a Network Software Engineer to develop a smart network infrastructure.

The goal is to craft a reliable, scalable and efficient network to support NVIDIA software development workflows and tools, including CI/CD pipelines, compute resource management flow and developer productivity tools. The network is serving the needs across the whole software stack for NVIDIA from Graphics Drivers to Autonomous Vehicles to Deep Learning frameworks. To achieve this goal, we are looking for an engineer who has a deep understanding of L3 underlay and overlay networks, outstanding design skills and a track record in automating and delivering large-scale networks.

What you'll be doing:

  • Lead the design, development, testing, and deployment of an AIOps platform

  • Apply machine learning, deep learning, natural language processing, and other AI techniques to solve network operations challenges such as anomaly detection, root cause analysis, incident management, and automation

  • Improve network operations by defining and measuring AIOps metrics such as accuracy, reliability, scalability, performance, and efficiency

  • Experience in implementing observability principles and practices such as monitoring, logging, tracing, and alerting

  • Deep Knowledge in data science engineering such as data collection, data cleaning, data analysis, data modeling, and data visualizations

  • Build services to automate monitoring and triaging activities and provide critical information to facilitate response and resolution of performance issues and incidents

  • Build automation which recognizes, troubleshoots, and analyzes system disruptions and develop solutions for improved reliability

  • Owning and driving integrations with various service APIs such as Cloud Service Providers, to automate creation of environments and auto populate data sources in turn. Breakdown targeted manual processes into reusable software modules that can be integrated as code

What we need to see:

  • 10+ years of network architecture and automation experience

  • PhD or equivalent experience plus proven track record in architecting and automating large scale enterprise grade networks for several types of organizations.

  • Familiarity and hands-on experience with Arista, Fortinet, Juniper, and Mellanox

  • Strong track record of implementing network services in a variety of distributed computing environments

  • Hands-on experience with high performance network and network optimization in highly-available, large-scale, multi-site, international environments

  • Hands-on experience with contributing to tooling and automation for provisioning, monitoring, and managing network infrastructure

  • Must be able to read, write and review automation code (Python, Bash, SQL, etc.) Uses independent judgment & a high level of innovation to set company-level technology strategies & processes to accomplish objectives

  • Must have strong interpersonal and organizational skills, including the ability to meet deadlines, work in a team environment, follow written policies and procedures, and maintain superior customer service at all times

We have some of the most forward-thinking people in the world working for us and, due to unprecedented growth, our business development teams are rapidly growing. If you're creative and autonomous with a real passion for your work, we want to hear from you.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 168,000 USD - 322,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Arista
Bash
Ci/Cd
Data Analysis
Data Visualization
Deep Learning
Fortinet
Juniper
Machine Learning
Mellanox
Natural Language Processing
Python
SQL

Similar Jobs

An Hour Ago
Remote
United States
Senior level
Senior level
Cloud • eCommerce • Enterprise Web • Information Technology • Software
Lead the engineering teams developing SDKs for iOS, Android, and Flutter apps. Oversee software development, testing, and release management, ensuring high-quality integration into client applications while driving team performance and customer relations.
Top Skills: AndroidFlutteriOSSdksSoftware Development
An Hour Ago
Remote
United States
160K-195K Annually
Senior level
160K-195K Annually
Senior level
Cloud • eCommerce • Enterprise Web • Information Technology • Software
Develop and maintain iOS SDKs, ensuring code quality through testing, and collaborate on product development, while supporting customer interactions.
Top Skills: AppiumFlutterObjective-CReact NativeSwiftXctest
An Hour Ago
Remote
USA
129K-152K Annually
Junior
129K-152K Annually
Junior
Cloud • Fintech • Cryptocurrency • NFT • Web3
Join Coinbase as a Software Engineer to build crypto-forward products, solve complex technical challenges, and contribute to the financial future using blockchain technology.
Top Skills: AWSDockerGoMongoDBPostgresRuby on RailsReact NativeRuby

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account