Overview
On Site
USD 96,800.00 per year
Full Time
Skills
Servers
Device Drivers
CUDA
UVM
Linux
Machine Learning (ML)
CPU
C
Python
Scripting Language
Artificial Intelligence
Software Engineering
Data Centers
Continuous Integration
Continuous Delivery
Oracle Linux
Ubuntu
Unit Testing
GPU
BIOS
BMC
Communication
Debugging
OCI
Customer Facing
Recruiting
Health Care
Taxes
Financial Planning
Legal
Insurance
Internal Communications
IC
Integrated Circuit
Cloud Computing
Value Engineering
Innovation
Life Insurance
Accessibility
Oracle
Law
Job Details
Job Description
OCI is driving development of next generation hyperscalar GPU data centers built on Nvidia and AMD GPUs. OCI enables popular AI services such as openAI on GPU compute servers. We are looking for engineers experienced in working with GPU device drivers and the runtime libraries (CUDA and ROCM). You must understand GPU architectural concepts such as UVM, host to device and device to host interactions including able to quantify performance issues in all such interactions. We are looking for strong experience in building and debugging issues that occur in the GPU drivers and Linux kernels that interact with GPU stack including functional and performance issues when running GPU AI/ML/inference workloads. The candidate should be able to use all standard tools targeted performance and stress such as DCGM, NCCL and RCCL suites. In addition, we are looking for experience debugging and diagnosing issues in the system reported via RAS events notified via the GPU BMC and other monitoring agents. The candidate should have breath knowledge in BIOS, CPU and GPU BMC and must show strong proficiency in C programming and working knowledge in Python or other scripting language used in AI/GPU environments
Responsibilities
As a member of the software engineering division, you will be required to have in depth knowledge of Nvidia and AMD GPU architecture working in a fast paced development environment on projects critical to OCI's success. You must demonstrate a good knowledge of GPU drivers including building and debugging issues related to them. You will regularly engage in debugging issues that are seen during new product bring up and at data centers running customer workloads including driving those issues with GPU vendors to resolution. All OCI engineers are expected to be on call periodically to handle OCI data center escalations. Must be comfortable with CI/CD pipelines to take vendor SW drops and build customized drivers against Oracle Linux and Ubuntu distributions, unit test functionality and run GPU workloads to validate performance using standard benchmarks. In addition, you should have working knowledge of the entire boot process including touch points with the BIOS and the BMC subsystems. We need engineers who show strong technical and communication skills as they engage with cross functional teams such as the HW and FW teams to debug issues and to ultimately drive OCI success.
Qualifications
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $96,800 to $251,600 per annum. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC5
About Us
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling +1 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
OCI is driving development of next generation hyperscalar GPU data centers built on Nvidia and AMD GPUs. OCI enables popular AI services such as openAI on GPU compute servers. We are looking for engineers experienced in working with GPU device drivers and the runtime libraries (CUDA and ROCM). You must understand GPU architectural concepts such as UVM, host to device and device to host interactions including able to quantify performance issues in all such interactions. We are looking for strong experience in building and debugging issues that occur in the GPU drivers and Linux kernels that interact with GPU stack including functional and performance issues when running GPU AI/ML/inference workloads. The candidate should be able to use all standard tools targeted performance and stress such as DCGM, NCCL and RCCL suites. In addition, we are looking for experience debugging and diagnosing issues in the system reported via RAS events notified via the GPU BMC and other monitoring agents. The candidate should have breath knowledge in BIOS, CPU and GPU BMC and must show strong proficiency in C programming and working knowledge in Python or other scripting language used in AI/GPU environments
Responsibilities
As a member of the software engineering division, you will be required to have in depth knowledge of Nvidia and AMD GPU architecture working in a fast paced development environment on projects critical to OCI's success. You must demonstrate a good knowledge of GPU drivers including building and debugging issues related to them. You will regularly engage in debugging issues that are seen during new product bring up and at data centers running customer workloads including driving those issues with GPU vendors to resolution. All OCI engineers are expected to be on call periodically to handle OCI data center escalations. Must be comfortable with CI/CD pipelines to take vendor SW drops and build customized drivers against Oracle Linux and Ubuntu distributions, unit test functionality and run GPU workloads to validate performance using standard benchmarks. In addition, you should have working knowledge of the entire boot process including touch points with the BIOS and the BMC subsystems. We need engineers who show strong technical and communication skills as they engage with cross functional teams such as the HW and FW teams to debug issues and to ultimately drive OCI success.
Qualifications
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $96,800 to $251,600 per annum. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC5
About Us
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling +1 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.