fix(pytorch-2.10-ec2): allowlist mesa CVE-2026-40393 for training images#6271
Open
bhanutejagk wants to merge 4 commits into
Open
fix(pytorch-2.10-ec2): allowlist mesa CVE-2026-40393 for training images#6271bhanutejagk wants to merge 4 commits into
bhanutejagk wants to merge 4 commits into
Conversation
Adds an os_scan_allowlist entry for mesa CVE-2026-40393 (CVSS v3 9.8 CRITICAL, WebGPU OOB) to the PyTorch 2.10 EC2 training CPU and GPU (cu130) allowlists. mesa is pulled in transitively via libgl1-mesa-glx; training containers do not expose a WebGPU/rendering surface to untrusted content, so the vulnerable code path is not reachable. Awaiting patched mesa in Ubuntu 22.04 (upstream fix in mesa 25.3.6 / 26.0.1). Also scopes dlc_developer_config.toml to the PyTorch training EC2 buildspec only, enables sanity/security/EC2 tests, and disables SageMaker test suites for this verification PR.
added 3 commits
June 19, 2026 09:07
…files Install nest-asyncio==1.6.0 explicitly in the common stage of the CPU and GPU Dockerfiles. Restores the package previously pulled in transitively; release-baseline regression test now matches.
Add SFTY-20250331-30014 (torch <=2.12.0 torch.jit.script memory corruption) to IGNORE_SAFETY_IDS for PT 2.10 EC2 training CPU/GPU. Affected code path is not exercised in DLC training; awaiting upstream patched torch.
Add SFTY-20260511-67155 to IGNORE_SAFETY_IDS for PT EC2 training py3. The CVE is for a commit made after flash-attn 2.8.3 was released and affects upstream repo training scripts, not the installed package. Mirrors the existing entry in the ECR enhanced-scan allowlist.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds an os_scan_allowlist entry for mesa CVE-2026-40393 (CVSS v3 9.8 CRITICAL, WebGPU OOB) to the PyTorch 2.10 EC2 training CPU and GPU (cu130) allowlists. mesa is pulled in transitively via libgl1-mesa-glx; training containers do not expose a WebGPU/rendering surface to untrusted content, so the vulnerable code path is not reachable. Awaiting patched mesa in Ubuntu 22.04 (upstream fix in mesa 25.3.6 / 26.0.1).
Also scopes dlc_developer_config.toml to the PyTorch training EC2 buildspec only, enables sanity/security/EC2 tests, and disables SageMaker test suites for this verification PR.
Purpose
Test Plan
Test Result
Toggle if you are merging into master Branch
By default, docker image builds and tests are disabled. Two ways to run builds and tests:
How to use the helper utility for updating dlc_developer_config.toml
Assuming your remote is called
origin(you can find out more withgit remote -v)...python src/prepare_dlc_dev_environment.py -b </path/to/buildspec.yml> -cp originpython src/prepare_dlc_dev_environment.py -b </path/to/buildspec.yml> -t sanity_tests -cp originpython src/prepare_dlc_dev_environment.py -rcp originNOTE: If you are creating a PR for a new framework version, please ensure success of the local, standard, rc, and efa sagemaker tests by updating the dlc_developer_config.toml file:
sagemaker_remote_tests = truesagemaker_efa_tests = truesagemaker_rc_tests = truesagemaker_local_tests = trueHow to use PR description
Use the code block below to uncomment commands and run the PR CodeBuild jobs. There are two commands available:# /buildspec <buildspec_path># /buildspec pytorch/training/buildspec.yml# /tests <test_list># /tests sanity security ec2sanity, security, ec2, ecs, eks, sagemaker, sagemaker-local.Toggle if you are merging into main Branch
PR Checklist
pre-commit run --all-fileslocally before creating this PR. (Read DEVELOPMENT.md for details).