
Seeking motivated IT professionals to join our expanding Automotive IT Services team & Hyundai Motor Group affiliates ๐
About the Company
Hyundai AutoEver America (HAEA) is a technology subsidiary of Hyundai Motor Group, providing advanced IT services and digital solutions across North America. Established in 2005 and headquartered in Orange County, California, the company plays a key role in driving innovation and operational efficiency within Hyundai Motor Group. With over 4,000 IT professionals across 23 subsidiaries and eight countries, HAEA continues to build the digital backbone for one of the worldโs leading automotive organizations.
About the Role
Hyundai AutoEver America is seeking a Senior or Lead Platform Engineer / Site Reliability Engineer (SRE) / Hadoop Administrator to manage and enhance a petabyte-scale, on-premises data platform built on the open-source Hadoop ecosystem. This role requires a hands-on technical leader with deep expertise in distributed systems, strong infrastructure knowledge, and a proven ability to ensure performance, reliability, and scalability.
Responsibilities
- Manage end-to-end infrastructure for a large-scale, Hadoop-based data platform ensuring high reliability and availability.
- Design, develop, and maintain components including Hadoop, Hive, Spark, NiFi, Iceberg, ELK, OpenSearch, and Ambari.
- Automate deployments, monitoring, and infrastructure management using CI/CD pipelines and scripting.
- Implement and maintain security policies, access controls, and compliance standards.
- Perform upgrades, patching, and performance optimization across platform services.
- Enhance observability through tools such as Prometheus, Grafana, and OpenTelemetry.
- Monitor system health, resolve incidents, and perform root-cause analysis.
- Collaborate with data engineering and analytics teams to align infrastructure capabilities with business needs.
- Lead technical discussions, mentor junior engineers, and advocate DevSecOps best practices.
- Drive operational excellence by improving reliability, automation, and scalability.
Required Skills
- Bachelorโs degree in Computer Science, Engineering, or related discipline.
- 10+ years of experience in Platform or Site Reliability Engineering with focus on distributed Hadoop infrastructure.
- Expertise in Hadoop ecosystem components (HDFS, YARN, Hive, Spark, NiFi, Ambari, Iceberg).
- Strong Linux administration skills (CentOS/Rocky), including system tuning and optimization.
- Proficiency with Docker, Kubernetes, and Infrastructure as Code tools (GitLab CI/CD, Python, bash).
- Experience with observability tools (Prometheus, Grafana, OpenTelemetry).
- Strong understanding of networking, security, and data compliance standards.
- Proven leadership, communication, and problem-solving abilities.
Preferred Qualifications
- Relevant certifications (Cloudera, Hortonworks, or equivalent).
- Experience managing petabyte-scale data platforms and disaster recovery implementations.
- Familiarity with data governance and metadata management.