Junior Hadoop Administrator (Apps Support Intmd Analyst )

Citi - Tampa, FL (30+ days ago)3.9


Primary Location: United States,Florida,Tampa
Education: Bachelor's Degree
Job Function: Technology
Schedule: Full-time
Shift: Day Job
Employee Status: Regular
Travel Time: No
Job ID: 18051145

Description

Job Purpose:
Hadoop administrator with strong analytical and technical ability with experience in hadoop ecosystem - hdfs,mapreduce,hive,pig,sqoop and apache spark
Responsible for in Installation and configuration of Hadoop, YARN, Cloudera manager, Cloudera BDR, Hive, HUE and MySQL applications
Able to work independently to assist different teams on POC projects.
Proactively manages hadoop system resources to assure maximum system performance by understanding business requirements.
Reviews performance stats and query execution/explain plans, and recommends changes for tuning Hive/Impala queries
Candidate should be able to enforce best practices in maintaining the environment as well as Service request management, Change request management and Incident management by using the standard tools of preference

Job Background/context:
The position is based in Tampa, Florida and is required to administer and support Big Data hadoop and analytics clusters.
Candidate will work independently and is highly self-motivated
Candidate will directly interact with the Vendor (Cloudera,Datameer and Paxata), keep himself updated with latest industry changes with respect to Hadoop and Unix systems. Candidate will have to interface with various development teams and create solutions which are Multi-Tenant and Cross LOB.
Applies skills and knowledge to develop creative solutions to meet operational excellence.
The candidate will work with complex and variable issues with substantial potential impact, weighing various alternatives and balancing potentially conflicting needs.

Qualifications

Key Responsibilities:
Installation and configuration of Hadoop, YARN, Cloudera manager, Cloudera BDR, Hive, HUE and MySQL
Implement Encryption, sentry and Upgrade complex patches across clusters.
Reviews performance stats and query execution/explain plans, and recommends changes for tuning Hive/Impala queries
Design and Build distributed, scalable, and reliable data pipelines that ingest and process data at scale and in real-time
Source huge volume of data from diversified data platforms into Hadoop platform
Perform analysis of large data sets using components from the Hadoop ecosystem
Reviews security management best practices including the ongoing promotion of awareness on current threats, auditing of server logs and other security management processes, as well as following established security standards.
Work proactively & independently to address project requirements, and articulate issues/challenges with enough lead time to address project delivery risks

Req Qualifications:
0-1 years knowledge of Big Data hadoop administration , preferably Cloudera Distribution.
0-1 years knowledge about Hadoop Architecture and HDFS , YARN, Spark, Solr ,Impala, Hive.
Familiarity with data loading tools like Flume, Sqoop, Kafka.
Knowledge of workflow/schedulers like Oozie.
Experience in optimizing and tuning Hadoop environment to meet the performance requirements
Experience with Spark .
Support analytic tools like Datameer, Platfora, R and Paxata
knowledge of Linux administration.

Mandatory Skills:
Cloudera hadoop administration
Hadoop ecosystem tools and Analytical tools integration with hadoop system.
Bachelor’s degree in Computer Science / Engineering
Experience in installation, design, capacity management, performance tuning, monitoring and scaling of Hadoop environments.
Troubleshoot Job issues, Impala queries, User Quota issues and Fix performance issues.
Ability to work independently and delivery results to meet deadlines .
Interpersonal skills to interact with global team members and clients
Excellent verbal and written communications skills