Handling-multi-byte-unicode-characters by teg-atlassian · Pull Request #1650 · atlassian/atlascode

teg-atlassian · 2026-02-24T07:11:26Z

What Is This Change?

In the process of the backend RovoDev sending a response message and Atlascode receiving the message, the following transformations occur

HTTP delivers raw bytes in arbitrary-sized chunks
The SSE (Server Side Event) parser splits on \n\n - but this split happens at the byte level, not character level
If \n\n falls in the middle of a multi-byte UTF-8 character, the split corrupts the data

This sometimes split tokens such as ****tool_name**** as ***t' and ool_name**** and this results in Atlascode throwing error and the session failing.

A more detailed (a little bit long) discussion about the problem and the solution can be found here.
With this PR, we

safely parse the the string: we don't assume the string is json
when the parsing fails, we attempt again by combining different consecutive chunks. So, the trick is just try different combinations.

How Has This Been Tested?

Basic checks:

npm run lint
npm run test
new tests

Advanced checks:

If Atlassian employee & Bitbucket changes: did you test with DC in mind? See Instructions

Recommendations:

Update the CHANGELOG if making a user facing change

Rovo Dev code review: Rovo Dev couldn't review this pull request
Upgrade to Rovo Dev Standard to continue using code review.

marcomura · 2026-02-24T20:22:26Z

src/rovo-dev/client/responseParser.ts

+                } catch {
+                    // JSON parse failed - likely due to incomplete multi-byte UTF-8 character at chunk boundary
+                    // Put this chunk back in the buffer and wait for more data
+                    this.buffer = chunkRaw + '\n\n' + this.buffer;


match[2] is the end of the chunkRaw.
If match[2] can't be parsed, adding it back to the buffer followed by '\n\n' will keep it broken.

Let's discuss about this

this code is wrapped in another loop, . So, as you said match[2] is the last token for this buffer but we get new data since we are in the loop.

marcomura

Based on how Rovo Dev responds, this change should not be necessary.
Let's discuss it.

bwieger-atlassian-com · 2026-02-24T21:31:45Z

My thought here is that this should be solved at the Rovo Dev Server layer, not at the client level... but that's just a first impression. Worth chatting with Tim Esler on this.

marcomura · 2026-02-24T23:33:38Z

Agree with @bwieger-atlassian-com that any issue in the response should be fixed at Rovo Dev level.

However, looking at the telemetry, I don't believe the response is split incorrectly, but I think the tool-response may be responding with a different format than what we are expecting (e.g., string instead of json).

ensuring "invalid" jsons are retried again to see if they are valid

cfe948f

teg-atlassian requested review from BHulovatyi, Blastoplex, amarg-at, bwieger-atlassian-com, cabella-dot, ccallcottstevens, cindy-atl, dchiew-atl, jwang19-atlassian, marcomura, matt-lassian, mattcolman, om-ukr, sdzh-atlassian and sky-ocean-spirits as code owners February 24, 2026 07:11

teg-atlassian changed the title ~~ensuring "invalid" jsons are retried again to see if they are valid~~ Handling-multi-byte-unicode-characters Feb 24, 2026

marcomura reviewed Feb 24, 2026

View reviewed changes

marcomura requested changes Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling-multi-byte-unicode-characters#1650

Handling-multi-byte-unicode-characters#1650
teg-atlassian wants to merge 1 commit intomainfrom
Handling-multi-byte-unicode-characters

teg-atlassian commented Feb 24, 2026 •

edited

Loading

Uh oh!

marcomura Feb 24, 2026

Uh oh!

teg-atlassian Feb 24, 2026

Uh oh!

marcomura left a comment

Uh oh!

bwieger-atlassian-com commented Feb 24, 2026

Uh oh!

marcomura commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

teg-atlassian commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What Is This Change?

How Has This Been Tested?

Uh oh!

marcomura Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

teg-atlassian Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

marcomura left a comment

Choose a reason for hiding this comment

Uh oh!

bwieger-atlassian-com commented Feb 24, 2026

Uh oh!

marcomura commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

teg-atlassian commented Feb 24, 2026 •

edited

Loading