[strongSwan] Occasional kernel crash at __xfrm_state_lookup

Jiri Horky jiri.horky at gmail.com
Fri Sep 19 14:21:02 CEST 2014


Hi Martin,

thx for the response.

On 09/19/2014 09:53 AM, Martin Willi wrote:
> Hi Jiri,
>
>> We experience relatively frequent kernel crash (~2 a day out of 102
>> nodes). I enclose a stack trace:
>> [17243.492885]  [<ffffffff81601c06>] xfrm_state_lookup+0x66/0x90
>> [17243.492907]  [<ffffffff8160796e>] xfrm_user_state_lookup+0x6e/0xe0
>> [17243.492930]  [<ffffffff81255078>] ? security_capable+0x18/0x20
>> [17243.492952]  [<ffffffff81608912>] xfrm_get_sa+0x42/0xc0
>> [17243.492971]  [<ffffffff816078db>] xfrm_user_rcv_msg+0x12b/0x150
> I haven't seen such crashes on any somewhat recent kernel.
>
>> The failing offset is 0x3168, which is on line 676:
>>    675          hlist_for_each_entry(x, net->xfrm.state_byspi+h, byspi) {
>>    676                  if (x->props.family != family ||
> To me this looks like a list corruption or something, hard to say what
> causes this. Probably a bug lingering somewhere in the kernel.
That's what I think as well, I suspect it is some kind of race condition
because of number of tunnels we have.
>
> I don't know what non-vanilla changes your kernel has. Probably you
> should check that issue with your kernel distributor.
Actually, it is pure vanilla kernel v3.14.13.

I will try to post the problem on kernel.org mailing list and get back
to you if it get resolved.

Jirka H.


More information about the Users mailing list