fix(encoding): explicitly use UTF-8 for all file I/O (#630) by oboehmer · Pull Request #798 · netascode/nac-test

oboehmer · 2026-04-27T08:28:15Z

Description

Fix UnicodeEncodeError on Windows when generating HTML reports containing Unicode characters (↕, ✓, →).

Python uses the system locale for open() by default — cp1252 on Windows, ASCII in minimal containers — which fails on the Unicode sort indicators and navigation arrows in the HTML report templates.

Closes

Fixes Windows: UnicodeEncodeError when generating HTML reports #630

Related Issue(s)

UnicodeEncodeError on Windows when stdout uses cp1252 encoding #723 (Windows emoji in stdout — the PYTHONIOENCODING CI workaround is now removed as redundant)

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Refactoring / Technical debt (internal improvements with no user-facing changes)
Documentation update
Chore (build process, CI, tooling, dependencies)
Other (please describe):

Test Framework Affected

PyATS
Robot Framework
Both
N/A (not test-framework specific)

Network as Code (NaC) Architecture Affected

Platform Tested

macOS (version tested: macOS 15)
Linux (distro/version tested: )

Key Changes

Add encoding="utf-8" to all open(), read_text(), write_text(), and NamedTemporaryFile(mode="w") calls across 16 production files
Fix generated Python scripts inside f-strings (subprocess_auth.py, subprocess_client.py) written to temp files and executed via os.system()
Set os.environ.setdefault("PYTHONUTF8", "1") at CLI entry point to propagate UTF-8 mode to child processes
Remove redundant PYTHONIOENCODING workaround from CI workflow (UnicodeEncodeError on Windows when stdout uses cp1252 encoding #723)
Consolidate e2e UTF-8 fixtures to success + mixed scenarios with meaningful UTF-8 in data fields that flow through the full YAML → Jinja2 → report pipeline

Testing Done

Unit tests added/updated
Integration tests performed
Manual testing performed:
- PyATS tests executed successfully
- Robot Framework tests executed successfully
- D2D/SSH tests executed successfully (if applicable)
- HTML reports generated correctly
All existing tests pass (pytest / pre-commit run -a)

Test Commands Used

uv run pytest tests/unit/ tests/e2e/ -x -q -n auto --dist loadscope
# 1768 passed, 377 skipped

Checklist

Code follows project style guidelines (pre-commit run -a passes)
Self-review of code completed
Code is commented where necessary (especially complex logic)
Documentation updated (if applicable)
No new warnings introduced
Changes work on both macOS and Linux
CHANGELOG.md updated (if applicable)

Additional Notes

Python 3.15 (PEP 686) will make UTF-8 mode the default, making PYTHONUTF8=1 a transitional measure
The explicit encoding="utf-8" on every I/O call is the primary fix; PYTHONUTF8=1 is belt-and-suspenders for child processes spawned via os.system() and subprocess

All open(), read_text(), write_text(), and aiofiles.open() calls in production code now pass encoding="utf-8", removing reliance on the system locale (which can be cp1252 on Windows or ASCII in minimal containers). Also adds encoding="utf-8" to the subprocess.run(text=True) call in the e2e test harness, which caused UnicodeDecodeError when capturing nac-test's emoji-containing stdout under a non-UTF-8 locale. E2e fixtures now contain multi-byte UTF-8 characters (in comments and docstrings only, not data values) to act as regression guards for the affected file I/O paths.

…CLI entry (#630) Add encoding='utf-8' to all NamedTemporaryFile and open() calls that were missed in the initial fix: subprocess_auth.py, subprocess_client.py, device_executor.py, subprocess_runner.py, orchestrator.py. Also fix generated Python scripts inside f-strings (subprocess_auth.py, subprocess_client.py) — these are written to temp files and executed via os.system(), so their open() calls need encoding too. Set os.environ.setdefault('PYTHONUTF8', '1') at CLI entry point (nac_test/cli/main.py) to propagate UTF-8 mode to all child processes. Add targeted Unicode regression test for combined report generation.

…os (#630) Revert decorative UTF-8 comments from 9 fixture scenarios (24 files) that only had UTF-8 in YAML comments and Python comments — content that never flows through the report pipeline. Keep and enhance success + mixed fixtures with meaningful UTF-8: - data.yaml: UTF-8 site names (SITE_München_100, SITE_日本_100) that flow through YAML → Jinja2 → test names → HTML report - pyATS TITLE/DESCRIPTION constants with German and Japanese text that render directly in the combined HTML report - Robot Documentation line with non-ASCII characters

The PYTHONIOENCODING env var in the Windows smoke test was a band-aid for #723 (cp1252 can't encode emoji). Now properly fixed at the source: PYTHONUTF8=1 at CLI entry point + explicit encoding='utf-8' on all I/O.

oboehmer · 2026-05-23T13:27:36Z

@aitestino , do you have an opinion if this is needed? technically it is not, current nac-test handles this fine on Linux (where utf-8 is default encoding) as well as Windows. Certain windows terminals will require utf-8 encoding env to display the emojis (nothing we can do here).. please review, but we can also close it..

oboehmer added 3 commits April 26, 2026 10:37

oboehmer force-pushed the fix/630-read-write-utf8 branch from c9ec5c2 to 872bb23 Compare April 27, 2026 08:35

chore(ci): remove redundant PYTHONIOENCODING workaround (#630, #723)

b9bbf8f

The PYTHONIOENCODING env var in the Windows smoke test was a band-aid for #723 (cp1252 can't encode emoji). Now properly fixed at the source: PYTHONUTF8=1 at CLI entry point + explicit encoding='utf-8' on all I/O.

oboehmer force-pushed the fix/630-read-write-utf8 branch from 872bb23 to b9bbf8f Compare April 27, 2026 08:52

oboehmer marked this pull request as draft April 27, 2026 09:29

oboehmer marked this pull request as ready for review May 23, 2026 13:27

oboehmer requested a review from aitestino May 23, 2026 13:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(encoding): explicitly use UTF-8 for all file I/O (#630)#798

fix(encoding): explicitly use UTF-8 for all file I/O (#630)#798
oboehmer wants to merge 4 commits into
mainfrom
fix/630-read-write-utf8

oboehmer commented Apr 27, 2026 •

edited

Loading

Uh oh!

oboehmer commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

oboehmer commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Closes

Related Issue(s)

Type of Change

Test Framework Affected

Network as Code (NaC) Architecture Affected

Platform Tested

Key Changes

Testing Done

Test Commands Used

Checklist

Additional Notes

Uh oh!

oboehmer commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

oboehmer commented Apr 27, 2026 •

edited

Loading