Skip to content

Test run hangs #10394

Closed as not planned
Closed as not planned
@jeffkl

Description

@jeffkl

The NuGet.Client repo has a CI that occasionally experiences a test hang. I've gotten diagnostic logs and am not sure how to proceed.

The main test host log shows this for 12 minutes:

TpTrace Verbose: 0 : 3760, 13, 2024/10/18, 19:56:21.066, 8745895652, vstest.console.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: 127.0.0.1:51210 localEndPoint: 127.0.0.1:51209 after 1004 ms
...
TpTrace Verbose: 0 : 3760, 13, 2024/10/18, 20:08:39.866, 16133844229, vstest.console.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: 127.0.0.1:51210 localEndPoint: 127.0.0.1:51209 after 1015 ms

And one of the hosts shows this for 7minutes:

TpTrace Verbose: 0 : 4508, 9, 2024/10/18, 19:55:39.666, 8331899048, testhost.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: [::ffff:127.0.0.1]:51209 localEndPoint: [::ffff:127.0.0.1]:51210
...
TpTrace Verbose: 0 : 4508, 9, 2024/10/18, 20:08:40.445, 16139625412, testhost.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: [::ffff:127.0.0.1]:51209 localEndPoint: [::ffff:127.0.0.1]:51210

All of the other hosts show them gracefully exiting:

TpTrace Information: 0 : 10988, 1, 2024/10/18, 19:55:05.806, 7993301964, testhost.dll, Testhost process exiting.
TpTrace Information: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993303632, testhost.dll, SocketClient.PrivateStop: Stop communication from server endpoint: 127.0.0.1:050821, error:
TpTrace Verbose: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993305167, testhost.dll, LengthPrefixCommunicationChannel.Dispose: Dispose reader and writer.
TpTrace Verbose: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993305526, testhost.dll, TcpClientExtensions.MessageLoopAsync: exiting MessageLoopAsync remoteEndPoint: [::ffff:127.0.0.1]:50821 localEndPoint: [::ffff:127.0.0.1]:50822

Eventually Azure DevOps times out the test run after 15 total minutes.

Logs: vstest.diag.zip

Failed build: https://dev.azure.com/dnceng-public/public/_build/results?buildId=847668&view=logs&s=feb4a670-8d1a-579f-6453-0b5a06576f62

Any advice on how to investigate this further would be greatly appreciated.

Here is our RunSettings: https://github.com/NuGet/NuGet.Client/blob/dev/build/xunit.runsettings

We're currently using .NET SDK 8.0.3xx, maybe I should try 8.0.4xx?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions