On Thu, 2008-08-07 at 20:02 +0200, Andi Kleen wrote:Most metadata is allocated in groups of 128k or 256k, and so most of the writes are nicely sized. The mirroring code has areas of the disk dedicated to mirror other areas. So we end up with something like this: metadata chunk A (~1GB in size) [ ......................... ] mirror of chunk A (~1GB in size) [ ......................... ] So, the mirroring turns a single large write into two large writes. Definitely not free, but always a fixed cost. I started to make some numbers of this yesterday on single spindles and discovered that my worker threads are not doing as good a job as they should be of maintaining IO ordering. I've been using an array with a writeback cache for benchmarking lately and hadn't noticed. I need to fix that, but here are some numbers on a single sata drive. The drive can do about 100MB/s streaming reads/writes. Btrfs checksumming and inline data (tail packing) are both turned on. Single process creating 30 kernel trees (2.6.27-rc2) Btrfs defaults 36MB/s Btrfs no mirror 50MB/s Ext4 defaults 59.2MB/s (much better than ext3 here) With /sys/block/sdb/queue/nr_requests at 8192 to hide my IO ordering submission problems: Btrfs defaults: 57MB/s Btrfs no mirror: 61.51MB/s -chris --
| Tarkan Erimer | Re: Dual-Licensing Linux Kernel with GPL V2 and GPL V3 |
| Greg KH | [GIT PATCH] driver core patches against 2.6.24 |
| Michal Piotrowski | Re: Linux 2.6.21-rc4 |
| Joe Peterson | Re: 2.6.25.3: su gets stuck for root |
git: | |
| Jarek Poplawski | [PATCH] pkt_sched: Destroy gen estimators under rtnl_lock(). |
| David Miller | [GIT]: Networking |
| Gerrit Renker | [PATCH 15/37] dccp: Set per-connection CCIDs via socket options |
| Emil S Tantilov | Re: WARNING: at include/net/sock.h:417 udp_lib_unhash |
