Author Topic: Optimizing ZFS settings for use with Photo Mechanic  (Read 3670 times)

Offline pdizzle

  • Newcomer
  • *
  • Posts: 26
    • View Profile
Optimizing ZFS settings for use with Photo Mechanic
« on: August 08, 2023, 05:02:31 PM »
I have been doing some experimenting with ZFS Recordsize and other dataset settings when opening photos in PM from my new NVMe server were absurdly slow (3min + for it to open and finish gathering sort data vs 45 seconds for it to open the same folder when stored on a windows machine on the same network).  What I found is that recordsize plays a huge role in the time it takes for PM to gather sort data.

Typically when it comes to ZFS datasets that store media, the general wisdom is to use 1M for recordsize since they are large files, but what I've found on two separate systems and OSes is that using 1M makes PM very slow.

I'm still doing some testing to find the optimal recordsize, but so far I've found that recordsizes between 16 and 512 seem to perform well enough.  Anything else starts to get very very slow when gathering sort data.  I will do some more testing and see which recordsize results in the lowest load time on average.

I also have compression off on this dataset because the data is non-compressible to begin with, so no need to waste CPU power on it.

Another thing I'm going to test once I've identified the best recordsize is primarycache=metadata for this dataset since it consists of NVMe drives.  It may be faster to leave files on NVMe instead of storing them in ARC.  I'm not sure if this workload is one of those cases.

A question I'm hoping a PM dev will pop in and answer, how much of each RAW file is PM reading when gathering sort data? And then when going through the preview window, how much of each file is PM reading? That could help influence the recordsize choice.

Offline Kirk Baker

  • Senior Software Engineer
  • Camera Bits Staff
  • Superhero Member
  • *****
  • Posts: 25020
    • View Profile
    • Camera Bits, Inc.
Re: Optimizing ZFS settings for use with Photo Mechanic
« Reply #1 on: August 08, 2023, 05:07:12 PM »
I have been doing some experimenting with ZFS Recordsize and other dataset settings when opening photos in PM from my new NVMe server were absurdly slow (3min + for it to open and finish gathering sort data vs 45 seconds for it to open the same folder when stored on a windows machine on the same network).  What I found is that recordsize plays a huge role in the time it takes for PM to gather sort data.

Typically when it comes to ZFS datasets that store media, the general wisdom is to use 1M for recordsize since they are large files, but what I've found on two separate systems and OSes is that using 1M makes PM very slow.

I'm still doing some testing to find the optimal recordsize, but so far I've found that recordsizes between 16 and 512 seem to perform well enough.  Anything else starts to get very very slow when gathering sort data.  I will do some more testing and see which recordsize results in the lowest load time on average.

I also have compression off on this dataset because the data is non-compressible to begin with, so no need to waste CPU power on it.

Another thing I'm going to test once I've identified the best recordsize is primarycache=metadata for this dataset since it consists of NVMe drives.  It may be faster to leave files on NVMe instead of storing them in ARC.  I'm not sure if this workload is one of those cases.

A question I'm hoping a PM dev will pop in and answer, how much of each RAW file is PM reading when gathering sort data? And then when going through the preview window, how much of each file is PM reading? That could help influence the recordsize choice.

Metadata access uses small reads and seeks through the EXIF data and makernote data for the most part.  Preview rendering reads the embedded JPEG data in the RAW file.  The great majority of the RAW file is not read at all.

HTH,

-Kirk

Offline pdizzle

  • Newcomer
  • *
  • Posts: 26
    • View Profile
Re: Optimizing ZFS settings for use with Photo Mechanic
« Reply #2 on: August 08, 2023, 05:46:25 PM »

Metadata access uses small reads and seeks through the EXIF data and makernote data for the most part.  Preview rendering reads the embedded JPEG data in the RAW file.  The great majority of the RAW file is not read at all.

HTH,

-Kirk


Hi Kirk,
Do you know roughly how many bytes will get read on average during the metadata access/gathering sort data phase?  is there a way to find out the size of the embedded jpeg?

Thanks in advance!

Offline pdizzle

  • Newcomer
  • *
  • Posts: 26
    • View Profile
Re: Optimizing ZFS settings for use with Photo Mechanic
« Reply #3 on: August 10, 2023, 10:47:49 AM »
In my tests, 16 to 512 recordsize all performed very closely.  I'm going to try with 512 recordsize and see how that goes.

Offline Kirk Baker

  • Senior Software Engineer
  • Camera Bits Staff
  • Superhero Member
  • *****
  • Posts: 25020
    • View Profile
    • Camera Bits, Inc.
Re: Optimizing ZFS settings for use with Photo Mechanic
« Reply #4 on: August 10, 2023, 11:09:41 AM »

Metadata access uses small reads and seeks through the EXIF data and makernote data for the most part.  Preview rendering reads the embedded JPEG data in the RAW file.  The great majority of the RAW file is not read at all.

Do you know roughly how many bytes will get read on average during the metadata access/gathering sort data phase?  is there a way to find out the size of the embedded jpeg?

The size of the data read is usually fewer than 64KB, but it varies depending on the file format and whether it has been modified since shot.
The size of the embedded JPEG preview can be parsed out (PM does this).  The embedded JPEG preview size also varies, usually less than a few MB.

-Kirk

Offline pdizzle

  • Newcomer
  • *
  • Posts: 26
    • View Profile
Re: Optimizing ZFS settings for use with Photo Mechanic
« Reply #5 on: August 11, 2023, 11:29:05 AM »
based on 8 runs of opening a contact sheet until it was done gathering sort data, the top 3 recordsizes were 512,16 and 32.  256 and 64 were pretty close after that, and then 128 was on the bottom by almost a second.