Bug 447 - Ideas to reduce the number of data copies during marshaling
Summary: Ideas to reduce the number of data copies during marshaling
Status: NEW
Alias: None
Product: TAO
Classification: Unclassified
Component: ORB (show other bugs)
Version: 1.0.13
Hardware: All All
: P2 major
Assignee: DOC Center Support List (internal)
URL:
Depends on:
Blocks:
 
Reported: 2000-02-19 20:14 CST by Carlos O'Ryan
Modified: 2007-09-20 03:08 CDT (History)
0 users

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Carlos O'Ryan 2000-02-19 20:14:28 CST
Currently the ORB performs data copies that could be eliminated. For example,
when sending a large sequence of longs the data could just be added to the
chain of message blocks.  This optimization is currently in place for octets,
but not for other basic data types.
On the demarshaling path some similar optimizations could be applied, but here
the situation is more complicated: the memory allocated for the CDR stream can
be allocated on the stack or TSS storage.  Whatever optimizations we use must
be limited to the case where the memory is only used in the current thread
activation, and does not survive the upcall to the application.
With octet sequences we use reference counting on the CDR message block, and
provide transparent extensions to the octet sequence implementation that
handle that reference count.  Unfortunately this technique will not scale,
and suffers from a number of problems, for example, when used in conjunction
with TSS allocators: the application could try to release the octet sequence in
another thread, leading to corruption in the TSS allocator.
A more promising approach is to:
- Inform the CDR stream if the memory it contains can be used in other threads,
  that info can be used to disable "dangerous" optimizations automatically.
  This is also useful in the buffered oneways case, to minimize data copying
  when the message is buffered.
- When demarshaling sequences it is possible to create the sequence without
  ownership of its data, just pointing to the CDR stream.  But this is only
  applicable if:
  + The CDR stream is alive during the upcall to the application.
  + The sequence is only used as an 'in' parameter.
  I'm not sure how to detect this case, but it could be an important way to
  improve the demarshaling time.
- When demarshaling Anys we perform several data copies, but finally the data
  is stored as a CDR stream to be demarshaled in the >>= operator.  Those
  copies could be optimized, but all the problems with TSS or stack allocated
  memory crop up again.
Comment 1 Carlos O'Ryan 2000-02-21 18:38:59 CST
*** Bug 356 has been marked as a duplicate of this bug. ***
Comment 2 Carlos O'Ryan 2000-02-26 22:17:59 CST
These entries are for documentation purposes, i have no time to work on them.
Comment 3 levine 2000-03-05 09:23:59 CST
Please assign such that we don't get the daily
reminder from bugzilla.
Comment 4 Carlos O'Ryan 2000-03-05 15:17:59 CST
They belong to tao-support until someone volunteers to take them.
Comment 5 Carlos O'Ryan 2000-03-05 15:18:59 CST
Left on tao-support until someone takes it over.
Comment 6 Carlos O'Ryan 2001-02-08 01:45:05 CST
Pradeep is working on this one.  It is a crucial optimization for the 1.2 
release, therefore it is now marked as a blocker.
Comment 7 Pradeep Gore 2001-02-16 12:36:18 CST
look forward to the optimization binge.. :)
Comment 8 Nanbor Wang 2001-06-21 07:21:18 CDT
Pradeep did some work on this. I dont think we have enough resources (time) to 
do this before 1.2. We did perform certain optimizations though.  I am just 
moving this out of the blocker list.
Comment 9 pradeep 2002-11-28 12:25:22 CST
I'am no longer working on this.
Comment 10 Nanbor Wang 2002-12-01 07:33:40 CST
Assigning it to me since I may get a chance to look into this
Comment 11 Nanbor Wang 2002-12-10 12:23:05 CST
Accepting the bug
Comment 12 Johnny Willemsen 2007-09-20 03:08:58 CDT
to pool