Using list_for_each_entry() in place of list_for_each_entry

Tetsuo Handa

unread,

Nov 1, 2014, 8:00:06 AM11/1/14

to

Excuse me for FAQ, Paul.
I want to confirm one thing for code optimization in LSM stacking.
( https://marc.info/?l=linux-security-module&m=141481716931982&w=2 )

In the following code, is there race window for seeing invalid
"struct list_head"->next value if we used list_for_each_entry()
in place of list_for_each_entry_rcu() ?

----------
/* Definition and declaration */
DEFINE_SPINLOCK(my_lock);
LIST_HEAD(my_list);
struct my_struct {
struct list_head list;
const unsigned long value;
} v1 = { .value = 1 }, v2 = { .value = 2 }, v3 = { .value = 3 };

/* Writer side */
void add_entry(struct my_struct *p) {
spin_lock(&my_lock);
list_add_tail_rcu(&p->list, &my_list);
spin_unlock(&my_lock);
}

void del_entry(struct my_struct *p) {
spin_lock(&my_lock);
list_del_rcu(&p->list);
spin_unlock(&my_lock);
}

/* Reader side */
unsigned long reader(void) {
struct my_struct *p;
unsigned long sum = 0;
list_for_each_entry_rcu(p, &my_list, list)
sum += p->value;
return sum;
}
----------

Assumptions are:

(1) v1, v2, v3 are statically allocated variables inside module,
while my_lock, my_list, add_entry(), del_entry(), reader()
are built-in.

(2) v1, v2, v3 are added to my_list only once upon module load

(3) v1, v2, v3 might be removed from my_list some time later after
module was loaded

Regards.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majo...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Paul E. McKenney

unread,

Nov 3, 2014, 9:00:06 PM11/3/14

to

On Sat, Nov 01, 2014 at 08:59:03PM +0900, Tetsuo Handa wrote:
> Excuse me for FAQ, Paul.

No problem -- this one took some thought.

> I want to confirm one thing for code optimization in LSM stacking.
> ( https://marc.info/?l=linux-security-module&m=141481716931982&w=2 )
>
> In the following code, is there race window for seeing invalid
> "struct list_head"->next value if we used list_for_each_entry()
> in place of list_for_each_entry_rcu() ?
>
> ----------
> /* Definition and declaration */
> DEFINE_SPINLOCK(my_lock);
> LIST_HEAD(my_list);
> struct my_struct {
> struct list_head list;
> const unsigned long value;
> } v1 = { .value = 1 }, v2 = { .value = 2 }, v3 = { .value = 3 };

OK, v1, v2, and v3 are assigned at compile time. Therefore, all CPUs
should be aware of their values -- if module loading were to fail
to make this guarantee, lots of things would break. However, their
initial ->list->next and ->list->prev values are NULL, courtesy of the
C initialization rules.

> /* Writer side */
> void add_entry(struct my_struct *p) {
> spin_lock(&my_lock);
> list_add_tail_rcu(&p->list, &my_list);

But we are adding to the tail of the list, so the initial NULL value
for ->list->next is correct anyway -- but only the first time they
are added to the list. If they were to be added to the list a second
time, and if something had been added after them during their first time
in the list, then the initial ->list->next value would no longer be
guaranteed to be seen as NULL by all other CPUs.

So if you do need to add them to the list a second time, one
way to do so safely is to use list_add_tail_rcu() when adding
and list_for_each_entry_rcu() -- enclosed in rcu_read_lock() and
rcu_read_unlock() -- when traversing. As always, you need to wait for
a grace period between the time you remove the element from the list
the first time and the time you add it to the list the second time.

Or you could do the following after deleting the element from the list:

o Wait for a grace period to elapse.

o NULL the ->list->next pointer.

o Wait for another grace period to elapse.

Then all CPUs would agree that the value of ->list->next was NULL, so
list_for_each_entry() could be used to traverse the list. But you would
still need rcu_read_lock() and rcu_read_unlock() around the traversal.

But if you only ever add it to the list that one time, then the
list_for_each_entry_rcu() could become list_for_each_entry(), and
rcu_read_lock() and rcu_read_unlock() are not needed. Again, this
assumes that the memory is never reused after being removed from the list.

Note that in this case module unload followed by module reload is tricky.
You have to "wait long enough" between unload and load. Or you need an
orderly teardown of some sort.

> spin_unlock(&my_lock);
> }
>
> void del_entry(struct my_struct *p) {
> spin_lock(&my_lock);
> list_del_rcu(&p->list);
> spin_unlock(&my_lock);
> }
>
> /* Reader side */
> unsigned long reader(void) {
> struct my_struct *p;
> unsigned long sum = 0;
> list_for_each_entry_rcu(p, &my_list, list)
> sum += p->value;
> return sum;
> }
> ----------
>
> Assumptions are:
>
> (1) v1, v2, v3 are statically allocated variables inside module,
> while my_lock, my_list, add_entry(), del_entry(), reader()
> are built-in.

When you say that my_lock and my_list are built-in, you mean that they
are defined in the base kernel rather than in the module? (I was assuming
that they were defined in the module.)

> (2) v1, v2, v3 are added to my_list only once upon module load
>
> (3) v1, v2, v3 might be removed from my_list some time later after
> module was loaded

Again, module unload followed by module load will be a bit dicey. Other
than that, it should work.

Thanx, Paul

Tetsuo Handa

unread,

Nov 4, 2014, 6:20:07 AM11/4/14

to

Paul E. McKenney wrote:
> But if you only ever add it to the list that one time, then the
> list_for_each_entry_rcu() could become list_for_each_entry(), and
> rcu_read_lock() and rcu_read_unlock() are not needed. Again, this
> assumes that the memory is never reused after being removed from the list.

That is what I wanted to confirm. v1, v2, v3 are added to the list only
once, and mymodule.ko containing v1, v2, v3 are not unload-able.

> Note that in this case module unload followed by module reload is tricky.
> You have to "wait long enough" between unload and load. Or you need an
> orderly teardown of some sort.

Yes, I'm aware of that.

> > Assumptions are:
> >
> > (1) v1, v2, v3 are statically allocated variables inside module,
> > while my_lock, my_list, add_entry(), del_entry(), reader()
> > are built-in.
>
> When you say that my_lock and my_list are built-in, you mean that they
> are defined in the base kernel rather than in the module? (I was assuming
> that they were defined in the module.)

I meant built-in as built into vmlinux. That is, my_lock, my_list,
add_entry(), del_entry(), reader() are defined in vmlinux , and v1, v2, v3
are defined in mymodule.ko .

>
> > (2) v1, v2, v3 are added to my_list only once upon module load
> >
> > (3) v1, v2, v3 might be removed from my_list some time later after
> > module was loaded
>
> Again, module unload followed by module load will be a bit dicey. Other
> than that, it should work.
>
> Thanx, Paul
>

If someone really needs to implement unload-able modules, he/she can
split such modules into non unload-able component and unload-able
component.

-------- non unload-able component --------
static DEFINE_SRCU(my_srcu_lock);
static int (*do_getvalue)(void);

static int mywrapper_getvalue(void)
{
const int idx = srcu_read_lock(&my_srcu_lock);
const typeof(do_getvalue) func = do_getvalue;
const int ret = func ? func() : 0;
srcu_read_unlock(&my_srcu_lock, idx);
return ret;
}

void mywrapper_register(typeof(do_getvalue) func)
{
do_getvalue = func;
}
EXPORT_SYMBOL_GPL(mywrapper_register);

void mywrapper_unregister(void)
{
do_getvalue = NULL;
synchronize_srcu(&my_srcu_lock);
}
EXPORT_SYMBOL_GPL(mywrapper_unregister);

struct my_struct {
struct list_head list;

const int (*func)(void);
} v1 = { .func = mywrapper_getvalue };

static int __init mywrapper_init(void)
{
add_entry(&v1);
return 0;
}
module_init(mywrapper_init);
-------- non unload-able component --------

-------- unload-able component --------
static int do_getvalue(void)
{
return (...snipped...);
}

static int __init mymodule_init(void)
{
mywrapper_register(do_getvalue);
return 0;
}
module_init(mymodule_init);

static void mymodule_exit(void)
{
mywrapper_unregister();
}
module_exit(mymodule_exit);
-------- unload-able component --------

Thank you.

Paul E. McKenney

unread,

Nov 5, 2014, 12:00:06 PM11/5/14

to

On Tue, Nov 04, 2014 at 08:18:03PM +0900, Tetsuo Handa wrote:
> Paul E. McKenney wrote:
> > But if you only ever add it to the list that one time, then the
> > list_for_each_entry_rcu() could become list_for_each_entry(), and
> > rcu_read_lock() and rcu_read_unlock() are not needed. Again, this
> > assumes that the memory is never reused after being removed from the list.
>
> That is what I wanted to confirm. v1, v2, v3 are added to the list only
> once, and mymodule.ko containing v1, v2, v3 are not unload-able.

Good enough, then. ;-)

> > Note that in this case module unload followed by module reload is tricky.
> > You have to "wait long enough" between unload and load. Or you need an
> > orderly teardown of some sort.
>
> Yes, I'm aware of that.
>
> > > Assumptions are:
> > >
> > > (1) v1, v2, v3 are statically allocated variables inside module,
> > > while my_lock, my_list, add_entry(), del_entry(), reader()
> > > are built-in.
> >
> > When you say that my_lock and my_list are built-in, you mean that they
> > are defined in the base kernel rather than in the module? (I was assuming
> > that they were defined in the module.)
>
> I meant built-in as built into vmlinux. That is, my_lock, my_list,
> add_entry(), del_entry(), reader() are defined in vmlinux , and v1, v2, v3
> are defined in mymodule.ko .

OK, even more straightforward, then.

> > > (2) v1, v2, v3 are added to my_list only once upon module load
> > >
> > > (3) v1, v2, v3 might be removed from my_list some time later after
> > > module was loaded
> >
> > Again, module unload followed by module load will be a bit dicey. Other
> > than that, it should work.
> >
> > Thanx, Paul
> >
>
> If someone really needs to implement unload-able modules, he/she can
> split such modules into non unload-able component and unload-able
> component.

Yep, that works also.

Thanx, Paul

Using list_for_each_entry() in place of list_for_each_entry_rcu() ?

Tetsuo Handa

Paul E. McKenney

Tetsuo Handa

Paul E. McKenney