As a key member of our global development team, you will:
Mentorship & Talent Development:
Act as a trusted advisor and coach for mid-level developers and analysts, providing guidance, fostering skill development, and judiciously allocating work to maximize team potential and project success.
Provide technical guidance, mentorship, and code reviews to junior data engineers, fostering a culture of excellence and continuous improvement.
Operational Excellence: Ensure adherence to best practices and essential procedures.
Autonomy & Ownership: Operate with a high degree of independence and judgment, taking ownership of critical initiatives and driving them to successful completion.
Risk Management: Proactively assess and manage technical risks, demonstrating a strong commitment to regulatory compliance, ethical judgment, and transparent reporting of control issues.
Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark.
Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions.
Optimize and tune Spark jobs for performance and efficiency.
Implement data quality checks and ensure data integrity across all data pipelines.
Data Architecture & Design: Design, develop, and optimize data architectures, pipelines, and data models to support various business needs, including analytics, reporting, and machine learning.
ETL/ELT Development (Python/PySpark Focus): Build, test, and deploy highly scalable and efficient ETL/ELT processes using Python and PySpark to ingest, transform, and load data from diverse sources into data warehouses and data lakes. Develop and optimize complex data transformations using PySpark.
Data Quality & Governance: Implement best practices for data quality, data governance, and data security to ensure the integrity, reliability, and privacy of our data assets.
Performance Optimization: Monitor, troubleshoot, and optimize data pipeline performance, ensuring data availability and timely delivery, particularly for PySpark jobs.
Qualifications:
We are seeking a talented individual with:
Experience: 6-10 years of progressive experience in systems analysis and programming of software applications, with a proven track record of implementing successful projects.
Technical Expertise :
Strong proficiency in Java application technologies, including deep experience with TDD (Test-Driven Development), Spring framework, and Microservices architecture.
Extensive hands-on experience with PySpark and advanced Python programming skills.
Proven experience with Big Data ecosystems, including Cloudera and/or Data Bricks.
Hands-on experience with distributed query engines like Starburst (Trino/Presto).
Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow.
Strong expertise in SQL and experience with relational and non-relational databases
Excellent knowledge of algorithms and data structures, design patterns. Experience in systems analysis and programming of software applications
Strong Java experience : Java core, collections, concurrency, streams
Frameworks and APIs: Spring (Core, Batch, Integration, MVC, Boot, Data), Hibernate, Jackson , JAX RS, JPA, JAXB
Experience with distributed caches like Apache Gem fire will be a plus
Messaging: JMS, Kafka
Experience in Angular 21+ / ReactJS
Testing: JUnit, Mocking frameworks (Mockito, Power Mock)
Experience in performance enhancements using parallel processing, multithreading. Understanding locking/synchronization
Understanding Docker and Kubernetes
Experience in RESTful API development and integration, deployment framework and source control experience such as Git.
Solid understanding and experience with SQL.
Proficiency in Linux environments.
Experience with job scheduling.
Methodology: Working knowledge of project management techniques and methods, with a focus on agile methodologies.
Adaptability: Ability to thrive in a fast-paced environment, manage multiple deadlines, and adapt quickly to evolving requirements and priorities.
Collaboration: A strong team player with excellent communication skills, capable of working effectively with global teams to deliver integrated solution
Experience with real-time data streaming and processing using PySpark Structured Streaming.
Knowledge of machine learning concepts and MLOps practices, especially integrating ML workflows with PySpark.
Familiarity with data visualization tools (e.g., Tableau, Power BI).
Contributions to open-source data projects.
Strong experience with SQL and NoSQL databases (e.g., PostgreSQL, MySQL, MongoDB, Cassandra).
Preferred:
Experience with AI development tools (eg. Copilot, Devin & Claude)
Prior experience or a keen interest in the financial services industry
Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
Experience of working in fast paced environment
Flexible and adaptive, team player
Excellent analytical and communication, interpersonal skills.
Education:
Bachelor’s degree/University degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
-
Technology
-
Applications Development
-
Full time
-
Irving Texas United States
-
$125,760.00 - $188,640.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
-
Please see the requirements listed above.
-
For complementary skills, please see above and/or contact the recruiter.
-
Jun 30, 2026
-
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.