Full Time
Fountain Valley, CA
Posted 3 months ago

Hyundai AutoEver America

Seeking motivated IT professionals to join our expanding Automotive IT Services team & Hyundai Motor Group affiliates 🌐

About the Company

Hyundai AutoEver America (HAEA) is a technology subsidiary of Hyundai Motor Group, providing advanced IT services and digital solutions across North America. Established in 2005 and headquartered in Orange County, California, the company plays a key role in driving innovation and operational efficiency within Hyundai Motor Group. With over 4,000 IT professionals across 23 subsidiaries and eight countries, HAEA continues to build the digital backbone for one of the world’s leading automotive organizations.

About the Role

Hyundai AutoEver America is seeking a Senior or Lead Platform Engineer / Site Reliability Engineer (SRE) / Hadoop Administrator to manage and enhance a petabyte-scale, on-premises data platform built on the open-source Hadoop ecosystem. This role requires a hands-on technical leader with deep expertise in distributed systems, strong infrastructure knowledge, and a proven ability to ensure performance, reliability, and scalability.

Responsibilities

Manage end-to-end infrastructure for a large-scale, Hadoop-based data platform ensuring high reliability and availability.
Design, develop, and maintain components including Hadoop, Hive, Spark, NiFi, Iceberg, ELK, OpenSearch, and Ambari.
Automate deployments, monitoring, and infrastructure management using CI/CD pipelines and scripting.
Implement and maintain security policies, access controls, and compliance standards.
Perform upgrades, patching, and performance optimization across platform services.
Enhance observability through tools such as Prometheus, Grafana, and OpenTelemetry.
Monitor system health, resolve incidents, and perform root-cause analysis.
Collaborate with data engineering and analytics teams to align infrastructure capabilities with business needs.
Lead technical discussions, mentor junior engineers, and advocate DevSecOps best practices.
Drive operational excellence by improving reliability, automation, and scalability.

Required Skills

Bachelor’s degree in Computer Science, Engineering, or related discipline.
10+ years of experience in Platform or Site Reliability Engineering with focus on distributed Hadoop infrastructure.
Expertise in Hadoop ecosystem components (HDFS, YARN, Hive, Spark, NiFi, Ambari, Iceberg).
Strong Linux administration skills (CentOS/Rocky), including system tuning and optimization.
Proficiency with Docker, Kubernetes, and Infrastructure as Code tools (GitLab CI/CD, Python, bash).
Experience with observability tools (Prometheus, Grafana, OpenTelemetry).
Strong understanding of networking, security, and data compliance standards.
Proven leadership, communication, and problem-solving abilities.

Preferred Qualifications

Relevant certifications (Cloudera, Hortonworks, or equivalent).
Experience managing petabyte-scale data platforms and disaster recovery implementations.
Familiarity with data governance and metadata management.

For additional information and the full job description, visit the link to our official website below:

Apply Now