Please report new issues athttps://github.com/DOCGroup
Created attachment 957 [details] TAO patch We (OCI) encountered a race condition involving ORBInitializer_Registry and ORBInitializer's registered by services in shared libraries. The PI library gets automatically loaded by whichever service attempting the first ORBInitializer registration. Registering service A causes shared libs to be loaded in order A -> PI. The PI as part of its destruction clears out all registered ORBInitializers. However since at PI's destruction, library A has already been unloaded the process cores trying to access A's ORBInitializer dstr. We have seen this problem in client tests involving CSD, however we haven't been able to reproduce this in clean (outside client code) environment. The solution is to tie library A's unloading with its ORBInitializers lifetime. I will next upload the proposed fix for this issue. This fix has been pushed into OCI 1.5a and I have tested it locally on DOC head. If there are no objections I will push this patch in next week. Ciju
Created attachment 958 [details] ACE patch
wouldn't it be a better solution to have an API to remove an already registered ORBInitializer? When working on the sg/sc issues with Iliyan this was one of the things that came to my mind, it was on my list to report an issue to the OMG about this. Also couldn't this be resolved by the sg/sc changes from Iliyan, we fixed there some ordering problems when unloading. we had similar problems there
patch seems to have some debug statements that are just for testing
(In reply to comment #2) > wouldn't it be a better solution to have an API to remove an already registered > ORBInitializer? When working on the sg/sc issues with Iliyan this was one of > the things that came to my mind, it was on my list to report an issue to the > OMG about this. > > Also couldn't this be resolved by the sg/sc changes from Iliyan, we fixed there > some ordering problems when unloading. we had similar problems there >
(In reply to comment #2) > wouldn't it be a better solution to have an API to remove an already registered > ORBInitializer? But someone will still have to defer the dll unloading until ORBInitializer destruction. This is done by the PortableInterceptor::DLL_Resident_ORB_Initializer class. > When working on the sg/sc issues with Iliyan this was one of > the things that came to my mind, it was on my list to report an issue to the > OMG about this. > > Also couldn't this be resolved by the sg/sc changes from Iliyan, we fixed there > some ordering problems when unloading. we had similar problems there I am not sure about that. Iliyan do your changes target the scenario described earlier? thanks, Ciju
As part of Iliyan's changes we did definitively fix some dll unload order problems. I think we should hold this change until there is a reproducer, I am afraid that we are making changes for something that already got resolved on svn head. Having an api to register an orbinitializer is something I consider usefull, independent of this issue, what do you think?
(In reply to comment #6) > ... Having an api to register an orbinitializer is something I consider > usefull, independent of this issue, what do you think? I think you mean de-register, right? (sorry, if I'm missing the point) Yes, I think there must be an equivalent of init/fini mechanism so the initializers can be un-plugged explicitly when unloading their DLLs from memory. However, I haven't given this a lot of thought and can't say how it should be best approached.
(In reply to comment #7) > (In reply to comment #6) > > ... Having an api to register an orbinitializer is something I consider > > usefull, independent of this issue, what do you think? > > I think you mean de-register, right? (sorry, if I'm missing the point) > Yes, I think there must be an equivalent of init/fini mechanism so the > initializers can be un-plugged explicitly when unloading their DLLs from > memory. However, I haven't given this a lot of thought and can't say how it > should be best approached. > Yes, unregister, typed to fast. I think we should file an issue to the OMG and then maybe add already a TAO specific extension
(In reply to comment #6) > As part of Iliyan's changes we did definitively fix some dll unload order > problems. I think we should hold this change until there is a reproducer, I am > afraid that we are making changes for something that already got resolved on > svn head. Having an api to register an orbinitializer is something I consider > useful, independent of this issue, what do you think? I agree. Having a deregistration API would have been great and could be another way to solve this problem. I didn't go that route because it changes the initializer destruction order in which things are currently done and therefore more prone to bugs creeping in. It seemed an easier fix to keep the dll around till the time the registry finishes up with all registered initializers. Currently I don't have a test for this. CSD or SSLIOP tests may show up this problem. Ciju
(In reply to comment #9) > (In reply to comment #6) > > As part of Iliyan's changes we did definitively fix some dll unload order > > problems. I think we should hold this change until there is a reproducer, I am > > afraid that we are making changes for something that already got resolved on > > svn head. Having an api to register an orbinitializer is something I consider > > useful, independent of this issue, what do you think? > > I agree. Having a deregistration API would have been great and could be another > way to solve this problem. I didn't go that route because it changes the > initializer destruction order in which things are currently done and therefore > more prone to bugs creeping in. It seemed an easier fix to keep the dll around > till the time the registry finishes up with all registered initializers. It would be a much better fix to deregister things, that can also be used for cases where things are not in a dll. when the dll does it cleanup it can cleanup the installed initializers. > Currently I don't have a test for this. CSD or SSLIOP tests may show up this > problem. We really need a test to see if this still is a problem. we had issues on doc head that dll's got unloaded to early, Iliyan addressed this in the service config framework.