On Fri, 2008-08-15 at 15:59 -0400, Theodore Tso wrote:
It sounds like ext4 would show the writeback_index bug with
fragmentation on disk and btrfs would show it with seeks during the
benchmark. I was only watching the throughput numbers and not looking
at filefrag results.
pdflush and delalloc and raid stripe alignment and lots of other things
don't play well together. In general, I think we need one or more
pdflush threads per mounted FS so that write_cache_pages doesn't have to
bail out every time it hits congestion.
The current write_cache_pages code even misses easy changes to create
bigger bios just because a block device is congested when called by
background_writeout()
But I would hope we can deal with a single threaded small file workload
like compilebench without resorting to big rewrites
Looks like everyone who walks sb->s_io or s_dirty walks it backwards.
This should make the newly dirtied inode the first one to be processed,
which probably isn't what we want. I could be reading it backwards of
course ;)
-chris
--