Required Qualifications :
3+ years of demonstrated ability with Hive, Python, Spark/Scala, SQL, etc.
Google Cloud Platform Experience, Big Query, Cloud Storage, Dataproc, Data Flow, Cloud Composer, Cloud SQL, Pub Sub, Terraform, etc.
Experience with Hadoop Ecosystem, Kafka, PCF cloud services
Familiar with big data and machine learning tools and platforms
Experience with BI tools, such as Alteryx, Data Stage, QlikSense, etc.
Design data pipelines and data robots, take a vision and bring it to life
Master data engineer; mentors others; works closely with IT architects to set strategy and design projects
Provide extensive technical, and strategic advice and guidance to key stakeholders around the data transformation efforts
Redesign data flows to prevent recurring data issues
Strong analytical and problem-solving skills
Possess excellent oral and written communication skills, as well as facilitation
and presentation skills, and engaging presentation style.
Ability to work as a global team member, as well as independently, in a
changing environment and prioritize.
Ability to establish and maintain coordinated and effective working relationships with application implementation teams, IT project teams, business customers, and end users.
Ability to deliver work within deadlines.
Experience with agile/lean methodologies
Experience working independently and with minimal supervision
Experience with Test Driven Development and Software Craftsmanship
Experience with GitHub, Accurev, or other version-control systems
Experience with Putty
Experience with Datastage
Strong Communications skills
Ability to illustrate and convey ideas and prototypes effectively with team and partners
Presence demonstrating confidence, ability to learn quickly, influence, and shape ideas
Key Skills Required Data Engineer
Python / PySpark / Scala
SQL & Hive
Hadoop Ecosystem
Data Pipeline Design & ETL Development
Google Cloud Platform (BigQuery, Dataproc, Dataflow, Cloud Storage)
Kafka / Streaming Data Processing
Terraform (Infrastructure as Code)
DataStage or Similar ETL Tools
Version Control (GitHub or equivalent)
Agile Methodologies
Strong Analytical & Problem-Solving Skills
Stakeholder Collaboration & Communication
Nice to Have:
Cloud Composer, Cloud SQL, Pub/Sub
BI Tools (Alteryx, QlikSense)
Machine Learning Platform Exposure
Test Driven Development (TDD)
Mentoring & Technical Leadership