`stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests #770

mhnap · 2024-02-21T21:34:05Z

Mountpoint for Amazon S3 version

mount-s3 1.4.0

AWS Region

us-east-2

Describe the running environment

Running in local Ubuntu 22.04.

Mountpoint options

mount-s3 mhnap-bucket /media/mhnap/mnt --read-only --log-directory=/tmp/mount-s3/logs --debug --debug-crt --log-metrics

What happened?

I have created dir1/dir2/dir3/ directory hierarchy with one file inside in the local system and uploaded dir1 to my newly created mhnap-bucket.

mhnap@hp:~/projects/playground$ aws s3 ls --recursive s3://mhnap-bucket
2024-02-21 22:59:13         10 dir1/dir2/dir3/file

After, I mounted this bucket using the mount-s3 command and called stat (using the same code from stat doc) for /media/mhnap/mnt/dir1/dir2/dir3/file path 10 times.

My concern is that I see also List and Head requests for dir1/, dir1/dir2/, dir1/dir2/dir3/, and not only one Head request for dir1/dir2/dir3/file as I would expect. In CloudWatch I see a total of 72 List and 45 Head requests for the whole test duration.

I may be missing something and it indeed can be correct behavior. In such a case, I would be grateful to find an explanation for such behavior.

Cannot paste logs, so uploaded them here: mountpoint-s3-2024-02-21T21-07-50Z.log

Relevant log output

No response

The text was updated successfully, but these errors were encountered:

jamesbornholt · 2024-02-22T01:18:25Z

Yeah, this is unfortunate but it's the expected behavior. It's common to all Linux file systems, which never get to see the entire path in one shot. Instead, they're always accessed one directory at a time: to access dir1/dir2/dir3/file, Linux first has to check that dir1 exists and is a directory, then dir2 exists and is a directory inside dir1, etc.

You can use metadata caching to work around the cost of these repeated lookups, although with the caveats you've mentioned in #768 and #759.

If you know you'll only be accessing a subdirectory of your bucket rather than the whole thing, you can also use the --prefix argument to mount just that subdirectory, which will remove the need to do some of these recursive lookups.

mhnap added the bug Something isn't working label Feb 21, 2024

jamesbornholt closed this as completed Feb 22, 2024

mhnap mentioned this issue Feb 22, 2024

stat call for path triggers two requests (one List and one Head) with enabled cache #777

Closed

passaro added question Further information is requested and removed bug Something isn't working labels Feb 26, 2024

fredDJSonos mentioned this issue Jul 10, 2024

Bazillion of ListBucket issued #938

Open

vladem mentioned this issue Dec 16, 2024

Reading a file creates a Bucket GET request #1198

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests #770

`stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests #770

mhnap commented Feb 21, 2024

jamesbornholt commented Feb 22, 2024

stat call for dir1/dir2/dir3/file triggers many List and Head requests #770

stat call for dir1/dir2/dir3/file triggers many List and Head requests #770

Comments

mhnap commented Feb 21, 2024

Mountpoint for Amazon S3 version

AWS Region

Describe the running environment

Mountpoint options

What happened?

Relevant log output

jamesbornholt commented Feb 22, 2024

`stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests #770

`stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests #770