[Linux-kernel-mentees] [PROJECT] clean up swapcache use of struct page

Shuah Khan skhan at linuxfoundation.org
Tue Aug 20 00:17:02 UTC 2019


Hi Matthew,

On 8/19/19 5:54 PM, Matthew Wilcox wrote:
> 
> This would be a good project for someone with a little experience and
> a lot of attention to detail.
> 
> The struct page is probably the most abused data structure in the kernel,
> and for good reason.  But some of the abuse is unnecessary ... a mere
> historical accident that would be better fixed.
> 
> Page cache pages use page->mapping and page->index to indicate which file
> the page belongs to and where in that file it is.  page->private may be
> used by the filesystem for its own purposes (eg buffer heads).
> 
> Anonymous pages use page->mapping to point to the anon VMA they belong
> to and page->index to record the offset within the VMA.  Then, if they
> are also part of the swap cache, they use page->private to record both
> the offset within the swap device and the index of the page within the
> swap device.
> 
> Then we get abominations like:
> 
> static inline pgoff_t page_index(struct page *page)
> {
>          if (unlikely(PageSwapCache(page)))
>                  return __page_file_index(page);
>          return page->index;
> }
> 
> My modest proposal for deleting the first two lines of that function is
> to first switch the uses of page->private and page->index for anonymous
> pages.  Then move the swp_type() back from page->index to page->private
> again [1].
> 
> I am willing to review patches and provide feedback.  I can go into more
> detail about how I think this should be tackled if there's interest.
> Also, if you know more than I do about the MM and think this is a bad
> idea, please do say ;-)
> 

I will be happy to add it to the project list for Spring session.

We can work together to come up with a task list to get the candidates
up to speed during the application process period which starts on Nov 1st.

> This is going to be a tough project because there are a lot of
> rarely-tested paths which directly reference (eg) page->index, and they
> might be talking about a page cache page or a swap page.  This is not
> a simple Coccinelle script.

Yes mm isn't an easy area especially for new developers. We can work
together to come up with a task list to get the candidates up to speed
during the application process period which starts on Nov 1st.

> 
> [1] We have enough bits to do this; on a 32-bit machine, we can at most
> have a VMA which covers 4GB memory and with a 4kB page size, that's only
> 20 bits needed to encode all possible offsets within a VMA).

That being said, I will wait for other mm experts weigh in.

thanks,
-- Shuah


More information about the Linux-kernel-mentees mailing list