Introduce CC-lock infrastructure #2

YWHyuk · 2022-04-28T10:25:14Z

CC-lock is based on flat-lock combining algorithm.
In this lock, only one thread, called combiner
thread the request of critical section. So, combiner
thread can exploit locality and aviod high contetetion
between lock varible.

When each cpu use only one node, let's assume lock
hold node A. In this case, node A's
(wait, completed) status should be (false, false).

Lock
|
A

When A,B cpu race occured, Let's assume that
B is win. Then, B will try to spin on A's wait
Status.

A -> B
w:F w:T

At the same time, A was enqueued. So, A's wait
status was set to True like below.

A -> B -> A
w:T w:T w:T

This lead to deadlock.

To avoid above node-reusing problem, each cpu has two
cc_node. Those node are used alternately.

A_0 -> B_0 -> A_1
w:f w:T w:T

Signed-off-by: Wonhyuk Yang vvghjk1234@gmail.com

CC-lock is based on flat-lock combining algorithm. In this lock, only one thread, called combiner thread the request of critical section. So, combiner thread can exploit locality and aviod high contetetion between lock varible. When each cpu use only one node, let's assume lock hold node A. In this case, node A's (wait, completed) status should be (false, false). Lock | A When A,B cpu race occured, Let's assume that B is win. Then, B will try to spin on A's wait Status. A -> B w:F w:T At the same time, A was enqueued. So, A's wait status was set to True like below. A -> B -> A w:T w:T w:T This lead to deadlock. To avoid above node-reusing problem, each cpu has two cc_node. Those node are used alternately. A_0 -> B_0 -> A_1 w:f w:T w:T Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Test reported that there is a deadlock. Situation are below. Node(0, 1) { req = 00000000d0495726, params = 000000002f36f5ac, wait = 0, completed = 1, refcount = 0, Next (2, 0) Prev (0, 0) } Node(2, 0) { req = 00000000d0495726, params = 000000002f36f5ac, wait = 1, completed = 0, refcount = 0, Next (2, 1) Prev (0, 1) } Node (0, 1)'s request are handled. So, it wait, completed status are (0, 1). But, it's next node Node(2, 0)'s wait are still 1. The combiner thread should set Node(2, 0) wait = 0. Previous logic set wait = 0, when DECODE_CPU(pending_cpu) != NR_CPUS. But there can be race between combiner thread and normal thread. In the combiner thread it check node->req first, then it check node->next. So there could be a situation below Node(0, 1) Node(2, 0) prev->req = req if(pending->req) ... DECODE_CPU(pending->next) prev->next = this_cpu To fix this, combiner thread check node->next first. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Previous, test thread used jiffes to measure the spent time. But, it's resolution is low. So all the results are zero or one. So use sched_clock. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

To keep order of reading node->next and writing of node->wait, node->completed, smp_mb should be used instead of smp_mb(). So fix it Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Using the "echo 2 > trigger", spinlock based benchmark can be run. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

YWHyuk added 10 commits April 28, 2022 19:24

Use sched_clock to measure time with high-resolution

7d9bd92

Previous, test thread used jiffes to measure the spent time. But, it's resolution is low. So all the results are zero or one. So use sched_clock. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Change smp_rmb to smp_mb

be59754

To keep order of reading node->next and writing of node->wait, node->completed, smp_mb should be used instead of smp_mb(). So fix it Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Make quit file to trigger reinit cc-lock

6568b49

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Add spinlock benchmark

aac3ab2

Using the "echo 2 > trigger", spinlock based benchmark can be run. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Add dmesg result parser

b5d3e21

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Seperate debug code with ifdef macrow

8d7c31f

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Fix print format of dmesg

798d56e

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Introduce control file

58a637c

YWHyuk force-pushed the cc-lock branch from eba84c3 to 542d314 Compare May 16, 2022 04:37

WIP

3adfe6d

YWHyuk force-pushed the cc-lock branch from 542d314 to 3adfe6d Compare May 16, 2022 06:08

YWHyuk added 14 commits May 16, 2022 07:13

Fix?

515fafa

Minor

3b4491f

Fix2?

d2c42b6

Fix dmesg format

7d4971a

실험 1

b2f92a7

Change cycle

9eeb609

Call printk after measurement done

f9e1a8f

Add utility script

803f725

Add variance

11d832f

Fix

33ad9ee

Add histogram feature

4d1f6a0

Add argument

27c9db0

Introuduce more fair start

6903c8c

Add benchmark ready debubfs for polling

cde0b90

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

YWHyuk force-pushed the cc-lock branch from 07bf556 to cde0b90 Compare June 24, 2022 00:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce CC-lock infrastructure #2

Introduce CC-lock infrastructure #2

Uh oh!

YWHyuk commented Apr 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Introduce CC-lock infrastructure #2

Are you sure you want to change the base?

Introduce CC-lock infrastructure #2

Uh oh!

Conversation

YWHyuk commented Apr 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant