Please report new issues athttps://github.com/DOCGroup
TAO 1.5.1 is what we compared to but the drop is there since change of sequences implementation to template based one. The suboptimal part is in demarshal_sequence(). sequence tmp(new_length); tmp.length(new_length); for(CORBA::ULong i = 0; i < new_length; ++i) { string_var string; if (!(strm >> string.inout ())) { return false; } else { tmp[i] = string._retn (); } } What happens that tmp is constructed with maximum of new_length. This uses allocbuf which creates a pointer buffer of that size and iterates over it filling it with 0s. The next call to length will then iterate over the buffer filling it with heap allocated empty strings. Immediately after this the loop above will heap allocate the real contents of the sequence and release all the previous empty strings by assigning the new values into the elements.
Performance will probably become even worse when bug#3473 is fixed.
added more deps, between 1.6.7 and 1.6.8 a bug has been fixed
I take this one. I take this one.(In reply to comment #2) > added more deps, between 1.6.7 and 1.6.8 a bug has been fixed > What do you mean? Which bug has been fixed? In which commit?
Xref: Scarab TAO#693.
added bugs that got already fixed as dependency
Created attachment 1078 [details] A proposed fix This fix also fixes 3473 and is closely related to 3550 and 3562. What is proposed is to add beside allocbuf a new function allocbuf_noinit which can be used with "T*data" constructor (from CORBA C++ mapping) when we create a temporary sequence. In this function a new buffer will not be filled in with default initialized values. Another thing that must improve performance is a new function copy_swap_range() implemented for element_traits. It swaps pointers for dynamically allocated types (i.e. for object_reference_traits, string_traits, and valuetype_traits) and copies values for all the rest (i.e. for array_traits and value_traits). This new function is used in generic_sequence::length(CORBA::ULong). And the last thing. It looks like ACE_OS::memset is better for performance then std::fill used in zero_range().
I don't have time to review the full patch, but make sure the end user never sees an old value or gets unitialized members, one of the changes between 1.5.1 and now is that we never return old data to the caller (security leak) when sequences are made smaller and bigger again. Always all values have to explicitly set to their initial value to also prevent from getting uninitialized values.
Recheck this change, is this needed? - if (length <= maximum_ || length <= length_) + if (length <= maximum_)
(In reply to comment #8) > Recheck this change, is this needed? > - if (length <= maximum_ || length <= length_) > + if (length <= maximum_) > Indeed, I forgot to mention that in the patch description. As far as I understand maximum_ is _always_ greater than length_ and thus the second check will never evaluate to true when the first one is false. Is it possible that length_ is greater that maximum_?
(In reply to comment #9) > (In reply to comment #8) > > Recheck this change, is this needed? > > - if (length <= maximum_ || length <= length_) > > + if (length <= maximum_) > > > > Indeed, I forgot to mention that in the patch description. As far as I > understand maximum_ is _always_ greater than length_ and thus the second check > will never evaluate to true when the first one is false. Is it possible that > length_ is greater that maximum_? maybe unbounded sequences? Enable the sequence unit tests (they do depend on boost, that dependency should be removed) and try to run these.
(In reply to comment #10) > (In reply to comment #9) > > (In reply to comment #8) > > > Recheck this change, is this needed? > > > - if (length <= maximum_ || length <= length_) > > > + if (length <= maximum_) > > > > > > > Indeed, I forgot to mention that in the patch description. As far as I > > understand maximum_ is _always_ greater than length_ and thus the second check > > will never evaluate to true when the first one is false. Is it possible that > > length_ is greater that maximum_? > > maybe unbounded sequences? Enable the sequence unit tests (they do depend on > boost, that dependency should be removed) and try to run these. > Those tests don't work according to bug#3488. The only build where it runs actually is http://tao.prismtech.com/doc-scoreboard/rhel4/gcc344/standard/index.html
> Those tests don't work according to bug#3488. The only build where it runs > actually is > http://tao.prismtech.com/doc-scoreboard/rhel4/gcc344/standard/index.html Then the tests have to be updated and fixed accordingly the behaviour we expect. Reducing the dependency on boost (see 3385) is the thing that has to be done asap, the sequence unit tests are important (given the number of open bugs on this).
(In reply to comment #12) > > Those tests don't work according to bug#3488. The only build where it runs > > actually is > > http://tao.prismtech.com/doc-scoreboard/rhel4/gcc344/standard/index.html > > Then the tests have to be updated and fixed accordingly the behaviour we > expect. Reducing the dependency on boost (see 3385) is the thing that has to be > done asap, the sequence unit tests are important (given the number of open bugs > on this). > Sorry, I don't have time currently for this. The test is very big and written so that it's not clear what it tests and how it must be done in a right (TAO) way.
> Sorry, I don't have time currently for this. The test is very big and written > so that it's not clear what it tests and how it must be done in a right (TAO) > way. Then make sure things are reviewed a lot of times, making changes to the sequences is always tricky and seems to cause regressions each time
(In reply to comment #14) > > Sorry, I don't have time currently for this. The test is very big and written > > so that it's not clear what it tests and how it must be done in a right (TAO) > > way. > > Then make sure things are reviewed a lot of times, making changes to the > sequences is always tricky and seems to cause regressions each time > The patch is actually tested already in our local repo on major platforms and in valgrind build. There seems to be no problems.
> The patch is actually tested already in our local repo on major platforms and > in valgrind build. There seems to be no problems. Your version is different then svn head, a test on a different tree doesn't give guarantee. The valgrind build has so many leaks that it is hard to tell whether there is a new leak, the real test are the sequence unit tests, but it is bad that they are not working. At least check in your build that things that worked are still working
Fixed in 84506. Wed Feb 18 10:37:15 UTC 2009 Vladimir Zykov <vz@prismtech.com> Fixed bug#3574 and bug#3473. This must improve performance when handling sequences. * tao/String_Traits_T.h: * tao/Value_Traits_T.h: * tao/Object_Reference_Traits_T.h: * tao/Array_Traits_T.h: * tao/Valuetype/Valuetype_Traits_T.h: Added copy_swap_range() which swaps pointers for dynamically allocated types and copies values for all of the rest. This is useful when we know that copying is reduntant since the old buffer will be destroyed immediately after the operation. Changed std::fill to ACE_OS::memset in zero_range() which turned to be faster when only needed to zero a region of memory. * tao/Bounded_Value_Allocation_Traits_T.h: * tao/Bounded_Reference_Allocation_Traits_T.h: * tao/Bounded_Array_Allocation_Traits_T.h: * tao/Unbounded_Value_Allocation_Traits_T.h: * tao/Unbounded_Reference_Allocation_Traits_T.h: * tao/Unbounded_Array_Allocation_Traits_T.h: * tao/Valuetype/Bounded_Valuetype_Allocation_Traits_T.h: * tao/Valuetype/Unbounded_Valuetype_Allocation_Traits_T.h: Added allocbuf_noinit which does what allocbuf previously did and now allocbuf works as required by CORBA C++ mapping i.e. it returns correctly initilized buffer. This also fixes bug#3473. * tao/Valuetype/Bounded_Valuetype_Sequence_T.h: * tao/Valuetype/Unbounded_Valuetype_Sequence_T.h: * tao/Unbounded_Sequence_CDR_T.h: * tao/Bounded_Sequence_CDR_T.h: Changed the way temporaries are created. Since they don't need a properly constructed internal buffer allocbuf_noinit is used for their creation. * tao/Unbounded_Octet_Sequence_T.h: * tao/Generic_Sequence_T.h: Updated the temporaries creation and improved generic_sequence:: length(CORBA::ULong) when extending a sequence by employing copy_swap_range() for old values while moving them to a new sequence.
In revision 84524. Thu Feb 19 10:59:31 UTC 2009 Vladimir Zykov <vz@prismtech.com> * tao/Generic_Sequence_T.h: Fixed a small mistake in the commit "Wed Feb 18 10:37:15 UTC 2009 Vladimir Zykov".
reopening, I have doubts about this change: Changed std::fill to ACE_OS::memset in zero_range() which turned to be faster when only needed to zero a region of memory. What is we have a complex data value, then we have objects in memory, not plain data Also this revert should be retested: length <= maximum_ What is length is set bigger then the maximum
after the changes of yesterday we still have 2 more failing tests that are based on boost.
(In reply to comment #19) > reopening, I have doubts about this change: > > Changed std::fill to ACE_OS::memset in zero_range() which > turned to be faster when only needed to zero a region of > memory. > > What is we have a complex data value, then we have objects in memory, not plain > data 84540 Fri Feb 20 11:36:50 UTC 2009 Vladimir Zykov <vz@prismtech.com> * tao/Value_Traits_T.h: Reverted the change from std::fill to ACE_OS::memset. Despite there was no problem with it on the scoreboard it was unsafe in case value_traits was instantiated for struct that has a non-POD type as its member. The rest of similar changes are equivalent to what we had before. > > Also this revert should be retested: > length <= maximum_ > > What is length is set bigger then the maximum > In general TAO allows to do the following: generic_sequence tmp (10, 20, generic_sequence::allocbuf (20), true); However, CORBA C++ mapping says that: 4.15 Mapping for Sequence Types .... "For a bounded sequence, attempting to set the current length to a value larger than the maximum length given in the OMG IDL specification produces undefined behavior." and later "For an unbounded sequence, the maximum() accessor function returns the total number of sequence elements that can be stored in the current sequence buffer. This allows applications to know how many items they can insert into an unbounded sequence without causing a reallocation to occur." So, for the conforming application even if really length_ > maximum_ it shouldn't assume that it can safely read/write sequence elements above maximum_. Moreover, if we had that additional check in length() that I removed then the following code could crash. generic_sequence tmp (10, 20, generic_sequence::allocbuf (10), true); tmp.length (15);
(In reply to comment #20) > after the changes of yesterday we still have 2 more failing tests that are > based on boost. > This test is broken and needs to be rewritten. Should it be an evidence that the fix is wrong?
(In reply to comment #22) > (In reply to comment #20) > > after the changes of yesterday we still have 2 more failing tests that are > > based on boost. > > > > This test is broken and needs to be rewritten. Should it be an evidence that > the fix is wrong? When a change is made that breaks a test it could mean two things, the test is wrong of the fix is wrong. If the test is wrong, then fix the test, else fix the change. It isn't acceptable to just accept more failing tests because of a change.
> In general TAO allows to do the following: > > generic_sequence tmp (10, 20, generic_sequence::allocbuf (20), true); > > However, CORBA C++ mapping says that: > > 4.15 Mapping for Sequence Types > .... > "For a bounded sequence, attempting to set the current length to a value larger > than the maximum length given in the OMG IDL specification produces undefined > behavior." > > and later > > "For an unbounded sequence, the maximum() accessor function returns the total > number of sequence elements that can be stored in the current sequence buffer. > This allows applications to know how many items they can insert into an > unbounded sequence without causing a reallocation to occur." > > So, for the conforming application even if really length_ > maximum_ it > shouldn't assume that it can safely read/write sequence elements above > maximum_. > > Moreover, if we had that additional check in length() that I removed then the > following code could crash. > > generic_sequence tmp (10, 20, generic_sequence::allocbuf (10), true); > tmp.length (15); Apps don't use generic_sequence directly. Carlos added a feature to detect a too big length and get an exception using range::check. I now see that the code we talk about is really a different case, we shrink the sequence, do we then release (or set to default value). It seems in the new code you need to allocate a new sequence and copy everything, in the old code we just cleared the members that wheren't accessable anymore. Seems that we have a performance drop because this. With the sequence unit tests (the boost one) you should be able to test this explicitly. If you shrink a sequence of 100 elements to 99 elements, you should just see the release of 1 member, no allocation at all. When you then grow to 100 there shouldn't be any allocation. Please make an explicit test for this case.
reopen because we need an explicit test for the following: - sequence of 100 elements, means buffer allocation of 100 elements - shrink to 99, means reinitialize 1 element to zap the old value - extend to 100, means no allocation
(In reply to comment #25) > reopen because we need an explicit test for the following: > - sequence of 100 elements, means buffer allocation of 100 elements > - shrink to 99, means reinitialize 1 element to zap the old value > - extend to 100, means no allocation > Any update on this?
In 84598. Wed Feb 25 10:07:20 UTC 2009 Vladimir Zykov <vz@prismtech.com> * tests/Bug_3574_Regression/test.cpp: * tests/Bug_3574_Regression/run_test.pl: * tests/Bug_3574_Regression/README: * tests/Bug_3574_Regression/Bug_3574_Regression.mpc: * bin/tao_orb_tests.lst: Added a test for bug#3574 and scheduled it for run.