Data Curator/Engineer

VMware - Atlanta, GA4.1

Full-timeEstimated: $100,000 - $140,000 a year
Global Services Digital Platform
Data Curator/Engineer
The Digital Platform team is a key component of VMware’s support organization that provides an enriched and valuable ecosystem of data and analytics that drive innovation for our customers and support engineers globally. Data is the most valuable asset in VMware and we're dedicated to develop data pipelines for helping our customers.

Position Summary:
VMware is seeking a Data Curator/Engineer to enable the data science and machine learning teams. The position works with the Senior Manager, Digital Platform Architecture.

The data curation role will “own” the data that we require to deliver predictive and proactive support and services. You will help harmonize incoming data from major sources including Skyline (a real-time feed of customer data center topology, configuration, and events), VMware SaaS products, and improve these feeds with user behaviour and profile information to deliver next-generation support. It involves curating data for different analytical tasks, to plan resources for accelerating data analysis, to add semantic meaning to a data catalog, to blend data sets together, and to organize project areas for teams of data analysts and data scientists to work together more effectively.

Your major areas of focus will help enable: the “self-driving data center,” “faster forensics” for our support teams, and the data for our “personalized platform.”

You will be responsible for lending understanding to the huge amounts of data that will drive our next-generation customer support experience.

Primary responsibilities include:
Provide thought leadership for harmonizing data across multiple business units and products. For example, from telemetry feeds of many VMware products, provide a single view of a customer without duplicating data.
Provide ownership of all data “types” that are involved in providing an amazing customer experience – advise product managers on potential high-value uses of available data about a customer environment or specific support users.
Maintaining the data catalog/dictionary of all inbound data to prevent duplication of data types and to enable data analysis with the right context. The dictionary will be used across many product units and development teams.
Ensure data is in a format that's consumable for feature-extraction for machine learning and data science (streaming and batch analytics)
You will be involved in proof-of-concept projects to build proven value from the ingested data.

Key Responsibilities include:
Drive internal proof of concept initiatives. When needed, quickly design and implement a prototype that can be shown to internal product or platform teams. This would include hands-on coding to extract or manipulate data as required.
Establish relationships with key architects across technology organizations and collaborate on harmonizing data and data pipelines.
Work with the CMBU (Cloud Management Business Unit) to drive toward the “self-driving data center.”
Standardizing metadata is hard, but you’ll fix that.
Experienced with a minimum of 5+ years in data engineering
Integration level design/coding experience/skills using Python, R, Java/Scala.
Familiar with TensorFlow, Keras, Torch and designing pipelines for these.
Familiar with data at scale
SQL/noSQL technology
Familiar with Database, ETL and BI technologies
Strong at technical goal setting for a project with actionable success metrics. Good knowledge and experience on measuring a service from user experience angle.
Strong at identifying problems, solving complex problems with simple solutions.
Think strategically: See patterns and relationships in information and events; clarify and simplify complex information; anticipate trends and possibilities that may lead to new business opportunities; consistently think and act “ahead of the curve;” anticipate and effectively respond with urgency to immediate opportunities; executes plans vigorously and with flexibility; operate actively; identify and address long-term opportunities.
Strong on working towards results and self-motivated, strong learning mindset, with deep understanding of related advanced/new technology. Keep up with the technology development in the related areas in the industry.
Good verbal, written, presentation, facilitation, and interaction skills, including ability to effectively communicate architectural issues and concepts to multiple organization levels and executive management.
Equal Employment Opportunity Statement
VMware is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind: VMware is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at VMware are based on business needs, job requirements and individual qualifications, without regard to race, color, religion or belief, national, social or ethnic origin, sex (including pregnancy), age, physical, mental or sensory disability, HIV Status, sexual orientation, gender identity and/or expression, marital, civil union or domestic partnership status, past or present military service, family medical history or genetic information, family or parental status, or any other status protected by the laws or regulations in the locations where we operate. VMware will not tolerate discrimination or harassment based on any of these characteristics. VMware encourages applicants of all ages. VMware will provide reasonable accommodation to employees who have protected disabilities consistent with local law.