Skip to content

[Ask For Help] metadata operations works but failed to write data into storage servers. #239

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
SimonCqk opened this issue Apr 13, 2025 · 2 comments

Comments

@SimonCqk
Copy link
Contributor

I have deployed 3FS in a production Kubernetes environment (RoCE RDMA container network). Components like Mgmtd and Meta appear normal, and the list-nodes command shows proper results. After mounting FUSE, metadata operations like directory listing work correctly, but data writes echo hello > /xxxx/hello.txt fail with the following errors. RDMA network connectivity between FUSE clients and Storage nodes has been verified

list node output

Image

stderr

[2025-04-13T07:18:58.223072746+00:00 hf3fs_fuse_main:235453 StorageClientImpl.cc:1173 ERROR] Give up retrying 1/1 ops 0x7f1fbc792758 after #3 retries, elapsed time: 60001ms, max retry time: 60000ms, user: Uid(0)/Gid(0), usercall: #1
[2025-04-13T07:18:58.223156990+00:00 hf3fs_fuse_main:235453 StorageClientImpl.cc:1637 ERROR] #1/1 Failed batchRead operation 0x7f1fbc7641f0 with id RequestId(0), routing target: ChainId(900100014)@ChainVersion(2)@RoutingInfoVersion(18):TargetId(0)@NodeId(0):ChannelId(0)#ChannelSeqNum(0), chunk range: ChunkId(00000000-00000003-E0040000-00000000), user: Uid(0)/Gid(0), usercall: #1, status: StorageClient::NotAvailable(7005)
[2025-04-13T07:18:58.223172010+00:00 hf3fs_fuse_main:235453 StorageClientImpl.cc:1637 ERROR] Finally collected 1/1 failed batchRead operations 0x7f1fbc77d768 issued by user Uid(0)/Gid(0), usercall: #1
[2025-04-13T07:18:58.223277021+00:00 hf3fs_fuse_main:235453 FuseOps.cc:1544 ERROR]   hf3fs_read error, ino 0x000000000003e004, size 4096, off 0, err -7005

stdout

Image

@SimonCqk SimonCqk changed the title [AFH] metadata operations works but failed to write data into storage servers. [Ask For Help] metadata operations works but failed to write data into storage servers. Apr 13, 2025
@SimonCqk
Copy link
Contributor Author

SimonCqk commented Apr 13, 2025

New stderr from storage-server side, it seems that targets creation were not completed as expected?

Image

Image

@Navacore
Copy link

Navacore commented Apr 20, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants