drbd: add back some fairness to AL transactions

When batching more updates to the activity log into single transactions, we lost the ability for new requests to force themselves into the active set: all preparation steps became non-blocking, and if all currently hot extents keep busy, they could starve out new incoming requests to cold extents for quite a while. This can only happen if your IO backend accepts more IO operations per average DRBD replication round trip time than you have al-extents configured. If we have incoming requests to cold extents, at least do one blocking update per transaction. In an artificial worst-case workload on SSD with an asynchronous 600 ms replication link, with al-extents = 7 (the minimum we allow), and concurrent full resynch, without this patch, some write requests have been observed to be starved for 40 seconds. With this patch, application observed a worst case latency of twice the replication round trip time. Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com> Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com> Signed-off-by: Jens Axboe <axboe@fb.com>
author: Lars Ellenberg <lars@linbit.com> 2014-04-28 18:43:28 +0200
committer: Jens Axboe <axboe@fb.com> 2014-04-30 13:46:55 -0600
commit: e4d7d6f4d36daff6aad84f96e48debde8e6ed09e (patch)
tree: de11d55c2590f5b1b279807c81c828ec102e10af /drivers/block/drbd/drbd_req.c
parent: fa090e708a72f0ea9cbe067fba28cfb2b8b787af (diff)
download: linux-e4d7d6f4d36daff6aad84f96e48debde8e6ed09e.tar.gz
linux-e4d7d6f4d36daff6aad84f96e48debde8e6ed09e.tar.xz
1 files changed, 20 insertions, 0 deletions
diff --git a/drivers/block/drbd/drbd_req.c b/drivers/block/drbd/drbd_req.c
index 7aadd0906b01..99411bf9d901 100644
--- a/drivers/block/drbd/drbd_req.c
+++ b/drivers/block/drbd/drbd_req.c
@@ -1242,6 +1242,7 @@ void do_submit(struct work_struct *ws)
 		if (list_empty(&incoming))
 			break;
 
+skip_fast_path:
 		wait_event(device->al_wait, prepare_al_transaction_nonblock(device, &incoming, &pending));
 		/* Maybe more was queued, while we prepared the transaction?
 		 * Try to stuff them into this transaction as well.
@@ -1280,6 +1281,25 @@ void do_submit(struct work_struct *ws)
 			list_del_init(&req->tl_requests);
 			drbd_send_and_submit(device, req);
 		}
+
+		/* If all currently hot activity log extents are kept busy by
+		 * incoming requests, we still must not totally starve new
+		 * requests to cold extents. In that case, prepare one request
+		 * in blocking mode. */
+		list_for_each_entry_safe(req, tmp, &incoming, tl_requests) {
+			list_del_init(&req->tl_requests);
+			req->rq_state |= RQ_IN_ACT_LOG;
+			if (!drbd_al_begin_io_prepare(device, &req->i)) {
+				/* Corresponding extent was hot after all? */
+				drbd_send_and_submit(device, req);
+			} else {
+				/* Found a request to a cold extent.
+				 * Put on "pending" list,
+				 * and try to cumulate with more. */
+				list_add(&req->tl_requests, &pending);
+				goto skip_fast_path;
+			}
+		}
 	}
 }
author	Lars Ellenberg <lars@linbit.com>	2014-04-28 18:43:28 +0200
committer	Jens Axboe <axboe@fb.com>	2014-04-30 13:46:55 -0600
commit	e4d7d6f4d36daff6aad84f96e48debde8e6ed09e (patch)
tree	de11d55c2590f5b1b279807c81c828ec102e10af /drivers/block/drbd/drbd_req.c
parent	fa090e708a72f0ea9cbe067fba28cfb2b8b787af (diff)
download	linux-e4d7d6f4d36daff6aad84f96e48debde8e6ed09e.tar.gz linux-e4d7d6f4d36daff6aad84f96e48debde8e6ed09e.tar.xz