Re: (ITS#4938) hdb_db_close SEGVs

----- quanah@zimbra.com wrote:
> ----- richton@nbcs.rutgers.edu wrote:
> > I don't have #5 (sleepycat#14657) nor the unofficial
> >
> http://www.stanford.edu/services/directory/openldap/configuration/patches/db/4252-region-fix.diff
> >
> > patch. As for the official one, I'm not sure about its relevance to
> > the
> > actual SEGV due to the "recovery...fail" comment. In other words,
> > though
> > it may be impacting the ability of alock/db_recover to do its thing,
>
> > that's just a side effect of the unclean shutdown which is the real
> > bug
> > here to my view.
>
>
>
>
> Patch #5 specifically deals with a race condition where a checkpoint
> is occuring while a cache buffer retrieval is also occuring causing a
> database corruption that will later not be recoverable from. At
> least, that's how I read sleepcat's description:
>
> 5. Fix a bug where cache buffer retrieval could race with a checkpoint
> call, potentially causing database environment recovery to fail.
> [#14657]
>
> Given that OpenLDAP checkpoints on shutdown, shutting down the server
> could be what is triggering the issue for you. I'd suggest applying
> the patch and seeing if this resolves your problem.
Just to note, I shut down one of my 2.3.35 servers that's served out over 1 million connections since I brought it up, and everything was clean on both shutdown and startup.
--Quanah