Senior Big Data Engineer in Work From Home (Remote) at Yahoo Inc

Date Posted: 5/17/2024

Job Snapshot

Job Description

Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It’s the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.

A Little About Us

The Yahoo Mail engineering team develops solutions powering our mail brands, including a next-generation infrastructure that we are 100% moving to a native public cloud architecture. The Mail Intelligence AI/ML platform is responsible for building intelligent, smart capabilities at scale to discover interests, reveal habits, and deeply personalize user journeys for Yahoo Mail and across the entire Yahoo’s ecosystem. 

We are looking for innovative, entrepreneurial, and passionate engineers. We are engineers who strive to deliver to our users only the absolute best and are willing to meticulously refine the details to achieve this goal. While Engineering is a core puzzle piece, we believe that your passion and owner mindset is as crucial as the high engineering standards, code quality and world-class architectural skills that we expect from our engineering teams.

We process billions of mail messages using cutting edge algorithms in areas including but are not limited to: Natural language processing, GenAI, Large Language Models, Machine Learning techniques, big data processing in order of petabytes to: Extract information, build mail content and user knowledge, and interconnect different sources to identify, highlight and amplify what matters.  

Our work spans many technical challenges highly rewarding and fulfilling to high-caliber engineers hungry for impactful problem statements. 

You will build tools and workflows to make it easier to manage and act on this vast information. You will also be working on AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features.

Our Hadoop clusters are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale data processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day: 

  • You will research and develop innovative algorithms for information retrieval, processing and ranking.

  • Take end to end ownership of Machine Learning-based distributed data systems - especially focused on data pipelines for data collection, validation and active learning and batch inference. 

  • Work with other engineers to implement algorithms and systems in an efficient way

  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions

  • Lead data investigations to troubleshoot data issues that arise along the data pipelines

  • Maintenance and improvement of released systems

  • Engineering consulting on large and complex warehouse data

Qualifications:

  • BS with 7+ years of relevant Industry experience/M.S. in Computer Science with 5+ years of relevant Industry experience. Computer Science graduate ideally with specialization in Data Engineering or Machine Learning

  • Experience in Hadoop technologies (Map/Reduce, Oozie, Pig, Hive, Spark, Kafka, HBase, Storm,).

  • Strong fundamentals: algorithms, distributed computing, data structure, database

  • Fluency with at least one of:Java/Python/C++

  • Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations

Nice to have:

  • Experience in any of: machine learning, analytics, data mining, or data mart and warehouse

  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell

  • Experience with machine learning algorithms, NLP, and/or statistical methods a big plus

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (

) or call
408-336-1409
. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

At Yahoo, we know that diversity makes us stronger. We are committed to a collaborative, inclusive environment that encourages authenticity and fosters a sense of belonging. We strive for everyone to feel valued, connected, and empowered to reach their potential and contribute their best. Check out our diversity and inclusion (

) page to learn more.

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience. The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions, in addition to equity incentives. Yahoo provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation, sick time, parental leave and an employee assistance program. Eligibility requirements apply.

Yahoo has a high degree of flexibility around employee location and hybrid working. In fact, our flexible-hybrid approach to work is one of the things our employees rave about. Most roles don’t require specific regular patterns of in-person office attendance. If you join Yahoo, you may be asked to attend (or travel to attend) on-site work sessions, team-building, or other in-person events. When these occur, you’ll be given notice to make arrangements. 

If you’re curious about how this factors into this role, please discuss with the recruiter.

Currently work for Yahoo? Please apply on our internal career site.

Job Requirements

Join the Apollo HBCUNet Talent Network

Joining the Apollo HBCUNet Talent Network will enhance your job search and application process. Whether you apply for a position or just leave your information with us, we hope to stay connected with you.

You can choose to sign up for alerts of new job opportunities that match your interests and background, or to receive relevant communications. You can also share job opportunities with your family and friends.

We are here to open more pathways to opportunities for diverse talent: but it all begins with you.

Join Apollo HBCUNet