Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Principal Software Engineer Lead - Microsoft Azure Edge + Platform Engineering System

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Cooper Brenneman

Phone: 717-919-4274
Email: cooper.brenneman@gmail.com
Seattle, WA 98122

Summary
• 8+ Years of Software Engineering industry experience in design, development, release, and support
• 6+ Years of Software Development experience in big data, Azure, backend systems, and UI
• 2.5+ Years in Software Engineering Team Management leading various teams and problem areas
• 2+ Years of Program Management experience working on products with Enterprise and External customers
• Experience in creating new teams, defining charters, and hiring employees of diverse experience and backgrounds
• Drive Azure Service Health review meetings, covering Postmortem Incident Reviews, KPIs, and Security for 50+ services
• Create forecasting models, budgets, and reports for multimillion-dollar development expenses and charges
• Instrument telemetry and drive improvements for new and legacy services to improve completion of OS test results
• Design service, team, and business SLOs, SLIs, and SLAs used to determine the success of every core OS release

Work Experience
Principal Software Engineer Lead – Microsoft Azure Edge + Platform Engineering System (Jan 2019 – Current)
Team Creation and Management
• Created a new Engineering Team and defined the team charter, mission statements, and key outcomes
• Defined the process and structure around work tracking, issue prioritization, product support, and development practices
• Conducted over 100 interviews to fill positions across the Azure and Microsoft organization based on Azure’s business needs
• Experienced in managing engineering groups of up to thirteen Full Time Microsoft Employees and Vendors

Engineering System Telemetry Instrumentation


• Currently lead a team focused on providing real time signals for monitoring ~36 million test results and 10 years of machine hours daily
across the Azure Engineering System Test Automation Platform
• Architected distributed telemetry instrumentation and data pipelines that monitor test orchestration, execution, and result delivery across
multiple on-prem and Azure cloud services
• Generated hundreds of automated service delivery and health incidents to decrease manual ticket creation for delivery by 300% and
improve the time to detect issues by over 200%
• Maintained telemetry pipelines consumed by partner teams and management for monitoring service health and KPIs in real-time
• Created alerting tools to prioritize, investigate, and mitigate hundreds of daily issues for more effective DRI and SRE experiences
• Generate data, reports, and experiences for managing multimillion-dollar annual budgets across the Azure Engineering System team

Organizational Service Health Operations


• Drive org-wide weekly Service Health Livesite Review meetings, covering Postmortem Incident Reviews, KPIs, budget, and security
• Created the process and tools to efficiently review 100+ Postmortem Incident Reviews for the 400+ engineer organization
• Security driver for over 50+ Azure Engineering System services, improving the overall security metrics month over month
• Communicate monthly service health, security, and prioritization updates to the Engineering System organization and Azure leadership

Service Health Measurement and Tracking


• Created Azure business critical KPIs to track improvements for release over release comparison
• Designed Azure Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs)
• Correlated service health to KPIs and metrics to understand how outages, incidents, and problems affect our customers in real time
• Provided key insights, root cause analysis, and repair items for incidents and issues affecting top level business KPIs and metrics

Continuous Integration Services for Azure OS Development


• Worked across organizations to design a solution to offer a continuous integration (CI) testing solution for Azure OS Engineering
• Collaborated with teams to create key adoption and service metrics to understand SLA, SLO, and SLI of services
• Created capacity models and budget forecasts for 2+ years of projected service growth and maintenance
• Enabled CI testing for 700+ monthly developers with a 92% overall reliability across infrastructure and product issues
Senior Software Engineer – Microsoft Windows Engineering System (May 2016 – Dec 2018)
Engineering System Quality of Service KPI Metric Design and Improvements
• Researched, designed, and evangelized the Windows Engineering System Quality of Service Metric used to understand overall Engineering
System completeness, reliability, and effectiveness for our service as seen by our customers
• Created high level KPI reports and analysis that were presented quarterly to the Windows Corporate Leadership team
• Helped drive priority and improvements of key infrastructure and product areas to increase overall Quality of Service of the Windows
Engineering System by 500% across multiple Windows release cycles

Test Content Configuration Management


• Designed and implemented a solution to enable Windows developers to self-serve test configuration for lab-based testing
• Enabled end to end local developer test validation using lab based physical and VM devices, allowing faster development cycles
• Converted 98% of required globally executed test content in the Windows 10 test system to use the new configuration model
• Defined the required criteria and process to effectively onboard new content, increasing transparency and decreasing test onboarding
duration SLA by almost 400%

Windows Test Health Measurement and Improvement


• Worked alongside Data Scientists and Program Managers to create key metrics to quantify the health of tests in Windows
• Created the Content Health Lifecycle to identify, validate, and improve non-deterministic and unhealthy test content using metric results
• Productized the data pipeline to actively identify and drive fixes for hundreds of tests, resulting in millions of clearer test signals, years of
saved machine time, and more secure testing across Windows
• Worked across teams to improve global required test reliability for Windows code validation from 69% to over 95%

Program Manager – Microsoft Windows Flighting and Feedback Team (Aug 2014 – May 2016)
Windows 10 Release Quality View and Release Metrics
• Formulated, designed, and evangelized the process to measure Microsoft’s readiness to release a new major version of the Windows OS
system based on key KPIs and business goals across the organization
• Worked with teams to improve internal telemetry data pipeline efficiency, decreasing latency from 48 hours to just over 3 hours
• Helped increase Dogfood engagement by 50% for PC and 300% for Phone through accurate transparency and accountability reporting
• Audited and improved list of objective measures used to assess internal and Insider Windows OS release candidates using telemetry
• Created reports and process that is still used today by teams, directors, and VPs to assess release readiness for upcoming Windows versions

Microsoft Feedback Hub Diagnostic and Feedback Pipeline Reliability


• Worked across multiple teams to measure the success of capturing, sending, and receiving diagnostics for the Feedback Hub App
• Created the data pipeline for developers to understand and investigate diagnostic delivery reliability and missing diagnostics
• Helped understand work and prioritize issues to improve pipeline reliability from 60% to over 90% as seen by the customers

Program Manager Intern – Microsoft Windows Work Tracking (June 2013 – Aug 2013)
Bug Tracking Reporting Tool
• Designed and implemented a data pipeline that allowed teams to visualize work items for more efficient work tracking
• Added features that allowed developers and managers to create custom visualizations based on the specific needs of their team

Skills
Technical Experience Tools
• Javascript • Microsoft Office
• C# • Power BI
• Azure • Azure DevOps
• SQL Management
• MEAN/MERN Stack
• Team Creation and Charter Definition
• Python
• Hiring and Termination Experience
• R
• Budget Forecasts and Capacity Modeling
• Java
• Work Management and Prioritization
• Windows and Linux OS/Devices
• Contract Creation and Negotiation

Education
University of Virginia - 2014
Electrical Engineering and Mathematics Double Major

You might also like