Position is for an SRT (Site Reliability Tools) Lead Engineer to architect. design, build, and support monitoring technologies from known vendors and bespoke solutions based on NodeJS, Python, java, .NET, Lambda, etc.
Work alongside other Cloud, SRT, and SRE engineers to help create and maintain a monitoring framework for a hybrid AWS and on-premise environment.
Work with multiple vendor technologies and various best-of-breed SaaS and enterprise cloud management tools/products (Splunk, New Relic, DataDog, Opsgenie, AWS, etc.).
Solve problems using a software development and continuous improvement mindset.
Design and implement reliable, scalable, and well-structured monitoring strategies and automation for cloud native applications.
Serve as team backup Scrum Master.
Implement architecture models and standards with direction setting support.
Define standards for designing, building, maintaining systems, systems/application components and common services including the recommended languages and tools.
Partner effectively with the product teams to manage scope and deliverables for the technical side of the product roadmap.
Able to identify areas of strategic technical debt and provide cost/benefit analysis for eliminating this debt and suggested timelines for how to prioritize it.
Introduce new technologies that will make the team and its output more efficient. This should include but not limited to cross knowledge training
Candidate should own and promote technologies and intimately know requirements to make multiple projects successful.
Document infrastructure and design decisions along with being able to explain those decisions both to business and tech owners.
Establish strong relationships with all internal stakeholders and service owners to ensure they are providing correct information to configure alerts
Translate business use-cases into operational dashboards and queries
Identify gaps in the visibility provided by logs collected to make recommendations for additional logging
Work on creating and implementing application monitoring and logging strategies using tools like New Relic and Splunk.
Build and maintain infrastructure on AWS.
Build tools and services from scratch to fill the existing technology gaps.
Work on troubleshooting application integration issues with development teams.
Remain up-to-date on industry trends and technologies and facilitate the application where appropriate.
Skills & Experience:
Possesses a solid understanding of modern web application architecture, TCP/IP, HTTP, and complex cloud network and security topologies.
Strong knowledge of monitoring tools architecture, implementation, and integrations in Hybrid DC and multi-cloud environment.
Hands on experience with administering Linux systems.
Scripting experience with Shell, Python or Ruby.
Experience with AWS services such as EC2, VPC, RDS, CloudWatch, CloudFront, Route53 etc.
Familiarity with SQL and relational databases (PostgreSQL, MySQL) and NoSQL databases (MongoDB, Redis), as well as AWS native RDS and DynamoDB.
Ability to use a wide variety of open source technologies and cloud services.
Willingness to learn and build new tools from scratch.
Strong knowledge on both operational monitoring services and executive dashboards using Splunk and New Relic services.
Experience in writing and tuning custom and unique requirements into reliable and scalable solutions.
Ability to dissect a set of business requirements, and translate them into technical requirements as well as identify the places or technologies critical to making projects efficient and successful.
Experience using Splunk Enterprise Security and In-depth knowledge of Enterprise Security capabilities and familiarity with Splunk's new User Behavior Analytics (UBA) and Security Orchestration and Automation (SOAR) offerings.
Understands and practices best practices for code promotion across the various environments. (builds, approvals, release) - understands SDLC and Agile/Iterative practices.
Additional cloud or monitoring relevant certifications and experience are advantageous but not required.
Dow Jones , Making Careers Newsworthy
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, protected veteran status, or disability status. EEO/AA/M/F/Disabled/Vets .
Dow Jones is committed to providing reasonable accommodation for qualified individuals with disabilities, in our job application and/or interview process. If you need assistance or accommodation in completing your application, due to a disability, please reach out to us at TalentResourceTeam@dowjones.com . Please put “Reasonable Accommodation" in the subject line.
Business Area: TECHNOLOGY - STS
Job Category: IT Development Group
Dow Jones is a global provider of news and business information, delivering content to consumers and organizations around the world across multiple formats, including print, digital, mobile and live events. Dow Jones has produced unrivaled quality content for more than 125 years and today has one of the world’s largest news gathering operations globally. It produces leading publications and products including the flagship Wall Street Journal, America’s largest newspaper by paid circulation; Factiva, Barron’s, MarketWatch, Financial News, DJX, Dow Jones Risk & Compliance, Dow Jones Newswires, and Dow Jones VentureSource.Dow Jones is a division of News Corp (NASDAQ: NWS, NWSA; ASX: NWS, NWSLV).
If you are a current employee at Dow Jones, do not apply here. Please go to the Career section on your Workday homepage and view "Find Jobs - Dow Jones." Thank you.
Req ID: 17800