login
Login
/
Register
Search
Search this site:
Forums
News
Blogs
Features
Site
Home
»
Mailing list archives
»
linux-kernel
»
2010
»
May
»
3
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
view
thread
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
[view in full thread]
From: Peter Zijlstra
Subject:
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
Date: Monday, May 3, 2010 - 9:55 am
On Mon, 2010-05-03 at 12:18 -0400, Rik van Riel wrote:
quoted text
> From: Rik van Riel <riel@redhat.com> > > Both the page migration code and the transparent hugepage patches expect > 100% reliable rmap lookups and use page_lock_anon_vma(page) to prevent > races with mmap, munmap, expand_stack, etc. > > Specifically, try_to_unmap indirectly calls vma_address, which uses the > difference between vma->vm_start and vma->vm_pgoff, which can race when a > stack is expanded downwards. VMA splitting and merging present similar > issues. > > With the new anon_vma code, one VMA can be attached to multiple anon_vmas, > however mmap, munmap, expand_stack and others only took one anon_vma->lock. > This patch changes things so we take the anon_vma locks for all of the > anon_vmas attached to a VMA in the code that try_to_unmap would otherwise > race against: mmap, munmap, expand_stack, etc. > > Unfortunately, this leads to a lock ordering conflict with the page_table_lock, > which protected the "same_vma" list in the anon_vma_chain. Replacing that > lock with a new lock (mm->anon_vma_chain_lock), which is taken higher up in > the mm locking hierarchy, solves that issue. This changes the locking rules > for the "same_vma" list to be either mm->mmap_sem for write, or mm->mmap_sem > for read plus the new mm->anon_vma_chain lock. This limits the place where > the new lock is taken to 2 locations - anon_vma_prepare and expand_downwards. > > Document the locking rules for the same_vma list in the anon_vma_chain and > remove the anon_vma_lock call from expand_upwards, which does not need it.
quoted text
> diff --git a/mm/mmap.c b/mm/mmap.c > index 456ec6f..81850fc 100644 > --- a/mm/mmap.c > +++ b/mm/mmap.c
quoted text
> @@ -1705,12 +1707,11 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > return -EFAULT; > > /* > - * We must make sure the anon_vma is allocated > - * so that the anon_vma locking is not a noop. > + * Unlike expand_downwards, we do not need to take the anon_vma lock, > + * because we leave vma->vm_start and vma->pgoff untouched. > + * This means rmap lookups of pages inside this VMA stay valid > + * throughout the stack expansion. > */ > - if (unlikely(anon_vma_prepare(vma))) > - return -ENOMEM; > - anon_vma_lock(vma); > > /* > * vma->vm_start/vm_end cannot change under us because the caller > @@ -1721,7 +1722,6 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > if (address < PAGE_ALIGN(address+4)) > address = PAGE_ALIGN(address+4); > else { > - anon_vma_unlock(vma); > return -ENOMEM; > } > error = 0; > @@ -1737,7 +1737,6 @@ int expand_upwards(struct vm_area_struct *vma, unsigned long address) > if (!error) > vma->vm_end = address; > } > - anon_vma_unlock(vma); > return error; > } > #endif /* CONFIG_STACK_GROWSUP || CONFIG_IA64 */
This does leave me worrying about concurrent faults poking at vma->vm_end without synchronization. --
unsubscribe notice
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to
majordomo@vger.kernel.org
More majordomo info at
http://vger.kernel.org/majordomo-info.html
Please read the FAQ at
http://www.tux.org/lkml/
Previous message: [
thread
] [
date
] [
author
]
Next message: [
thread
] [
date
] [
author
]
Messages in current thread:
[PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 9:18 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 9:41 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 9:53 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Peter Zijlstra
, (Mon May 3, 9:55 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 10:02 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Peter Zijlstra
, (Mon May 3, 10:11 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 10:17 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 10:18 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 10:58 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Andrea Arcangeli
, (Mon May 3, 11:13 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Linus Torvalds
, (Mon May 3, 11:19 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Rik van Riel
, (Mon May 3, 11:38 am)
Re: [PATCH 1/2] mm: Take all anon_vma locks in anon_vma_lock
, Mel Gorman
, (Tue May 4, 6:12 am)
Navigation
Mailing list archives
Recent posts
Popular discussions
linux-kernel
:
Greg KH
Og dreams of kernels
Jens Axboe
[PATCH 31/33] Fusion: sg chaining support
Arnd Bergmann
Re: finding your own dead "CONFIG_" variables
Mark Brown
[PATCH 2/2] Subject: natsemi: Allow users to disable workaround for DspCfg reset
Tony Breeds
[LGUEST] Look in object dir for .config
git
:
Brian Downing
Re: Git in a Nutshell guide
John Benes
Re: master has some toys
Matthias Lederhofer
[PATCH 4/7] introduce GIT_WORK_TREE to specify the work tree
Alexander Sulfrian
[RFC/PATCH] RE: git calls SSH_ASKPASS even if DISPLAY is not set
Junio C Hamano
Re: Rss produced by git is not valid xml?
git-commits-head
:
Linux Kernel Mailing List
iSeries: fix section mismatch in iseries_veth
Linux Kernel Mailing List
ixbge: remove TX lock and redo TX accounting.
Linux Kernel Mailing List
ixgbe: fix several counter register errata
Linux Kernel Mailing List
b43: fix build with CONFIG_SSB_PCIHOST=n
Linux Kernel Mailing List
9p: block-based virtio client
linux-netdev
:
Michael Breuer
Re: [PATCH] af_packet: Don't use skb after dev_queue_xmit()
Michael Breuer
Re: [PATCH] af_packet: Don't use skb after dev_queue_xmit()
David Daney
[PATCH 5/7] Staging: Octeon Ethernet: Convert to NAPI.
Wolfgang Grandegger
[PATCH net-next v4 1/3] can: mscan: fix improper return if dlc < 8 in start_xmi...
Amit Kumar Salecha
[PATCHv3 NEXT 2/2] NET: Add Qlogic ethernet driver for CNA devices
openbsd-misc
:
Theo de Raadt
Re: Old IPSEC bug
Tomáš Bodžár
Problem with vpnc connection - check group password !
Insan Praja SW
Mandoc Compiling Error
Carl Roberso
Re: Cannot change MTU of carp in