Closed as not planned
Description
The NuGet.Client repo has a CI that occasionally experiences a test hang. I've gotten diagnostic logs and am not sure how to proceed.
The main test host log shows this for 12 minutes:
TpTrace Verbose: 0 : 3760, 13, 2024/10/18, 19:56:21.066, 8745895652, vstest.console.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: 127.0.0.1:51210 localEndPoint: 127.0.0.1:51209 after 1004 ms
...
TpTrace Verbose: 0 : 3760, 13, 2024/10/18, 20:08:39.866, 16133844229, vstest.console.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: 127.0.0.1:51210 localEndPoint: 127.0.0.1:51209 after 1015 ms
And one of the hosts shows this for 7minutes:
TpTrace Verbose: 0 : 4508, 9, 2024/10/18, 19:55:39.666, 8331899048, testhost.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: [::ffff:127.0.0.1]:51209 localEndPoint: [::ffff:127.0.0.1]:51210
...
TpTrace Verbose: 0 : 4508, 9, 2024/10/18, 20:08:40.445, 16139625412, testhost.dll, TcpClientExtensions.MessageLoopAsync: Polling on remoteEndPoint: [::ffff:127.0.0.1]:51209 localEndPoint: [::ffff:127.0.0.1]:51210
All of the other hosts show them gracefully exiting:
TpTrace Information: 0 : 10988, 1, 2024/10/18, 19:55:05.806, 7993301964, testhost.dll, Testhost process exiting.
TpTrace Information: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993303632, testhost.dll, SocketClient.PrivateStop: Stop communication from server endpoint: 127.0.0.1:050821, error:
TpTrace Verbose: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993305167, testhost.dll, LengthPrefixCommunicationChannel.Dispose: Dispose reader and writer.
TpTrace Verbose: 0 : 10988, 7, 2024/10/18, 19:55:05.806, 7993305526, testhost.dll, TcpClientExtensions.MessageLoopAsync: exiting MessageLoopAsync remoteEndPoint: [::ffff:127.0.0.1]:50821 localEndPoint: [::ffff:127.0.0.1]:50822
Eventually Azure DevOps times out the test run after 15 total minutes.
Logs: vstest.diag.zip
Failed build: https://dev.azure.com/dnceng-public/public/_build/results?buildId=847668&view=logs&s=feb4a670-8d1a-579f-6453-0b5a06576f62
Any advice on how to investigate this further would be greatly appreciated.
Here is our RunSettings: https://github.com/NuGet/NuGet.Client/blob/dev/build/xunit.runsettings
We're currently using .NET SDK 8.0.3xx, maybe I should try 8.0.4xx?
Metadata
Metadata
Assignees
Labels
No labels