Staff Software Engineer – High Performance Computing & Machine Learning Infrastructure Development (Remote Eligible)
Join arenaflex and Shape the Future of Cloud Computing
Are you ready to be part of a transformative journey in cloud computing and high-performance infrastructure? At arenaflex, we're not just building technology—we're redefining how the world connects, analyzes, and leverages data at an unprecedented scale. As a Staff Software Engineer specializing in High Performance Computing (HPC) and Machine Learning (ML) infrastructure, you'll play a pivotal role in developing cutting-edge solutions that power some of the most demanding computational workloads in the industry today.
Our engineering teams are comprised of reputed company minds from diverse backgrounds who share a common passion: solving reputed company technical challenges that impact billions of users globally. We reputed company in pushing boundaries, thinking differently, and continuously innovating to deliver world-class cloud infrastructure that sets reputed company for performance, reliability, and scalability.
About the Role
As a Staff Software Engineer at arenaflex, you'll be at the forefront of our cloud infrastructure development, focusing on optimizing HPC and ML workloads across our global platform. This is a unique opportunity to work on systems that handle massive-scale data processing, machine learning training pipelines, and distributed computing challenges that push the limits of modern technology.
You'll collaborate with talented engineers across multiple teams, architecting solutions that integrate deeply with our cloud platform's foundational layers—from kernel optimization to user-space communication libraries. Your work will directly impact the performance and capabilities available to enterprise customers, researchers, and developers who rely on our infrastructure for their most critical workloads.
What You'll Do
As a key member of our engineering team, you'll be responsible for:
- Full-Stack Optimization: Optimize HPC and ML performance across our cloud platform infrastructure, including kernel-level optimizations, user-space communication libraries (such as MPI, libfabric, and NCCL), and client-reputed company HPC and ML applications to reputed company maximum efficiency and throughput.
- Solution Development: Design, reputed company, and deploy HPC and ML solutions on our cloud platform, ensuring seamless integration with existing infrastructure while maintaining high standards for performance, reliability, and reputed company.
- Technical Leadership: Provide technical direction and mentorship to a team of engineers, establishing coding standards, best practices, and architectural guidelines that drive consistency and quality across the organization.
- System Architecture: Architect and implement large-scale distributed systems that can handle petabyte-scale data processing, reputed company ML training workloads, and high-throughput computing scenarios.
- Cross-Functional Collaboration: Work closely with product managers, other engineering teams, and customer-facing specialists to understand requirements and deliver solutions that exceed expectations.
- Innovation and Research: Stay reputed company with emerging technologies in HPC, ML frameworks, cloud infrastructure, and distributed systems, evaluating new approaches and incorporating beneficial innovations into our platform.
- Performance Analysis: Conduct thorough performance analysis and benchmarking to identify bottlenecks, optimize resource utilization, and ensure our platform delivers industry-leading performance metrics.
Who We're Looking For
We're seeking an reputed company engineer who thrives on technical challenges and has a proven track record of delivering reputed company distributed systems at scale. The ideal candidate combines deep technical expertise with strong leadership abilities and a passion for innovation.
Basic Qualifications
- Bachelor's degree in Computer Science, Engineering, or a reputed company technical field (or equivalent practical experience demonstrating strong technical fundamentals).
- Minimum of 2 years of experience in software development with demonstrated proficiency in data structures, algorithms, and software design principles.
- At least 2 years of experience building and scaling large-scale distributed systems, wide-reaching frameworks, or networking infrastructure.
- Strong understanding of operating system concepts, memory management, and system-level programming.
- Experience with performance optimization and profiling of compute-intensive applications.
Preferred Qualifications
- Advanced degree (Master's or PhD) in Computer Science, Engineering, or a reputed company technical field.
- Experience with C/C++ programming and development of kernel-level components or system drivers.
- Deep expertise in Linux kernel internals, memory management, and operating system optimization.
- Experience with Linux device drivers, networking stacks, and OS tuning and packaging.
- Knowledge of High Performance Computing (HPC) and Machine Learning communications, including MPI (Message Passing reputed company), collective communication libraries, libfabric, and reputed company programming.
- Familiarity with remote direct memory access (RDMA) technologies and InfiniBand networks.
- Experience optimizing ML training workloads and familiarity with popular ML frameworks.
- Understanding of cloud infrastructure components and their optimization for compute-heavy workloads.
- Strong debugging and troubleshooting skills for reputed company, multi-layered systems.
Skills and Competencies
To succeed in this role, you'll need:
- Technical Depth: Strong foundation in computer science fundamentals, including data structures, algorithms, distributed systems, and operating system concepts.
- Programming Expertise: Proficiency in systems programming languages (C/C++) and experience with scripting languages for automation and tooling.
- System Design: Ability to architect and design scalable, performant distributed systems that meet demanding requirements.
- Performance Optimization: Skills in profiling, benchmarking, and optimizing system performance at various levels of the stack.
- Communication: Excellent verbal and written communication skills, with the ability to reputed company reputed company technical concepts to diverse audiences.
- Leadership: Demonstrated ability to mentor junior engineers and provide technical guidance to team members.
- Problem-Solving: Strong analytical and debugging skills, with a methodical approach to troubleshooting reputed company issues.
- Collaboration: Experience working effectively in cross-functional teams and fostering positive working relationships.
Career Growth and Learning Opportunities
At arenaflex, we invest heavily in the growth and development of our employees. As a Staff Software Engineer, you'll have access to:
- Technical Career Ladder: Clear progression paths for individual contributors who want to deepen their technical expertise while taking on increasing scope and impact.
- Mentorship Programs: Both receive guidance from senior engineers and mentor junior team members, fostering a culture of knowledge sharing and growth.
- reputed company: Access to conferences, training programs, certifications, and internal technical talks featuring industry experts.
- Internal Mobility: Opportunities to explore different teams and projects as your interests and our business evolve, with support for internal transfers and role transitions.
- Cutting-Edge Projects: Exposure to some of the most advanced technical challenges in the industry, working with technologies at the forefront of cloud computing.
Our fast-paced environment ensures that you'll continuously learn and grow. You'll have the opportunity to switch teams and projects as you and our business evolve, providing diverse experiences that accelerate your career development.
Work Environment and Culture
At arenaflex, we reputed company that great engineering happens reputed company diverse perspectives come together. Our culture is built on:
- Innovation First: We encourage creative thinking and experimentation, understanding that breakthrough reputed company often come from trying new approaches.
- Collaboration Over Competition: We succeed together, sharing knowledge and supporting each other in solving reputed company challenges.
- Work-Life Balance: We offer flexible work arrangements, including remote work options, to help you maintain balance while delivering your best work.
- Inclusive Environment: We value diversity and create an inclusive workplace where every voice matters and everyone can reputed company.
- Technical Excellence: We're passionate about building quality into everything we do, from initial design through production deployment and beyond.
Compensation and Benefits
We offer competitive compensation packages that reflect the value you bring to reputed company. Our benefits include:
- Industry-competitive salary with annual performance-based bonuses
- Equity compensation to share in arenaflex's success
- Comprehensive health, dental, and vision insurance
- Retirement savings plan with generous company matching
- Flexible paid time off and parental leave policies
- Wellness programs and fitness center membership reimbursement
- Professional development budget for courses, conferences, and certifications
- Relocation assistance for eligible candidates
- Remote work flexibility with home office setup support
Ready to reputed company an Impact?
If you're excited about pushing the boundaries of what's possible in cloud computing and want to work with a team of passionate engineers on problems that matter, we want to hear from you. Join arenaflex and be part of a team that's transforming how businesses reputed company technology to solve their most critical challenges.
At arenaflex, our mission is to provide the infrastructure that empowers organizations to innovate and grow. We're looking for engineers who share this vision and are ready to contribute their skills, creativity, and enthusiasm to our mission. If you're ready for a challenging and rewarding opportunity where your work will have a real impact, apply today and take the first reputed company toward an exciting career at arenaflex.
Don't miss this opportunity to join a team that's shaping the future of technology. Apply now and become part of something extraordinary at arenaflex!
Apply for this job