Bug 886 - Reactive Flushing Strategy race condition
Summary: Reactive Flushing Strategy race condition
Status: RESOLVED FIXED
Alias: None
Product: TAO
Classification: Unclassified
Component: ORB (show other bugs)
Version: 1.1.14
Hardware: All All
: P3 blocker
Assignee: Carlos O'Ryan
URL:
Depends on:
Blocks: 132 296
  Show dependency tree
 
Reported: 2001-04-25 18:08 CDT by Carlos O'Ryan
Modified: 2001-08-08 12:44 CDT (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Carlos O'Ryan 2001-04-25 18:08:00 CDT
The reactive flushing strategy flushes a transport using:

      TAO_ORB_Core *orb_core = transport->orb_core ();

      while (!transport->queue_is_empty ())
        {
          int result = orb_core->run (0, 1, ACE_TRY_ENV);
          ACE_TRY_CHECK;

          if (result == -1)
            return -1;
	}


unfortunately this loop has a race condition. The queue could be cleaned up by
another thread, before we can enter the event loop in orb_core->run(). 
Currently the problem only shows up when using AMI requests, for example,
$TAO_ROOT/tests/LongUpcalls.

A simple workaround is to use the Block_Flushing_Strategy.

The only solution that I can think of right now involves adding more complexity
to the Leader/Followers code, when the ORB flushes a transport it would need to
follow the same rules that "client" threads follow when they join the group.
But such a change is too complicated for me to take on at this point.
Comment 1 Carlos O'Ryan 2001-05-06 18:57:17 CDT
This one is mine.
Comment 2 Carlos O'Ryan 2001-05-09 12:55:07 CDT
Until this bug is fixed 132 should be open.
Comment 3 Carlos O'Ryan 2001-05-09 12:57:40 CDT
This bug would probably take 1 week, maybe 2. I think I can start work on it by
early next week (i.e. May, 14th)
Comment 4 Carlos O'Ryan 2001-05-17 12:07:30 CDT
I thought I had accepted this bug already.
Comment 5 Carlos O'Ryan 2001-08-08 12:44:30 CDT
The bug was fixed last week.  At least the one test that used to hang due to 
this problem does not anymore.
There are some other bugs that we are trying to track down, I suspect they will 
require some minor fixes, should they not we will re-enter the bugs as needed.