Summary: | Invocation code should be refactored | ||
---|---|---|---|
Product: | TAO | Reporter: | Jeff Parsons <parsons> |
Component: | ORB | Assignee: | Ossama Othman <ossama.othman> |
Status: | RESOLVED FIXED | ||
Severity: | enhancement | ||
Priority: | P3 | ||
Version: | 1.2.5 | ||
Hardware: | All | ||
OS: | All | ||
Bug Depends on: | |||
Bug Blocks: | 1390, 1592, 1986, 285, 286, 451, 586, 845, 1089, 1229, 1496, 1637, 1988, 2172 |
Description
Jeff Parsons
2002-11-21 11:43:49 CST
Accepted for tao-support. I think we can use those techniques to further reduce the size of the generated code *and* solve a host of problems for collocated calls. Remember that the arguments (and return value) for an operation are passed to the Invocation class using an array of Argument* objects. Well, suppose the call was actually collocated. Instead of doing the crazy stuff we do today, we could simply generate two skeleton functions, the regular skeleton and a "skeleton for collocated calls", which would look like this: /* static */ void POA_MyServant::_tao_collocated_skel_my_operation( size_t argument_count, TAO::Argument *arguments[], ....) { this skeleton function could extract the arguments using dynamic casts (or their emulated version if we lack RTTI support): // First check the argument count against the real number of // arguments if(argument_count != 5) throw CORBA::MARSHAL(); // Then downcast each argument and raise if downcast is unsucessful Real_Argument_Type * _tao_first_argument = dynamic_cast<Real_Argument_Type*>(arguments[0]); if(_tao_first_argument == 0) throw CORBA::MARSHAL(); Once all the arguments have been downcasted we can get the Servant (POA mediated or not) and call the implementation method directly. There is probably some trickery in getting this stuff to work while keeping the POA decoupled. Basically we will need to know if the object has *any* chance of being collocated (we can do this when the object is created using the same techniques we use today.) If so, we need to call the object adapter to handle the invocation object. If collocation does not apply (for whatever reason) we follow the normal path. In short, a few more methods in that object adapter abstract class. What are the advantages of doing this vs. the current approach? 1) We would not need to generate special proxy classes for the collocated case. Just the special skeleton. This may prove to provide some footprint savings... or not. Basically is a tradeoff, so hopefully the difference is small. OTOH: 2) It also eliminates the need for the proxy brokers, thus we save in compilation time and code footprint. Basically we would need to implement the broker code once, at the library level. 3) Because the arguments are wrapped in the little Argument classes the client-side and server-side interceptors for collocated calls can be moved to the library too (more features for the same footprint / compilation-time.) 4) Because the code is in the library it is easier to control (for example via some Policy) when is collocation used, and what type applies. When we remove the sequence base class generation, we should also remember to make sure that replace() and get_buffer() are in all the sequence template classes. Some of them are missing. This isn't about invocation code refactoring, but it does apply to footprint reduction, so I wanted to get it down before I forget it. We can make the IDL compiler smart about what files from TAO it includes in generated code. One of the things this would require is to keep an inverse of the member lists - that is, declarations would need to know not only what other declarations they reference, but also what other declarations reference them, and what types these other declarations are. Of course, if no declarations of a certain type are present, that may also be important. Information of this kind can also affect directly what does or does not get generated. For example, if a given interface is not used in a sequence, there is no need to generate the forward declaration workarounds for the object sequence methods _downcast and _upcast. In a branch, I've already separated the code generation for these particular workarounds from the other four: _duplicate, CORBA::release, _nil, and marshal (a new one to prevent the marshaling of local interfaces), so it will be easy to turn this piece of code generation off, if it turns out to be possible to achieve this optimization. Adding bug 133 as a dependency, because if the collocation decision is implememented in the library instead of the "when the object reference is created", then we can easily look at the target object and decide if we need to follow the collocated path or not. Strictly speaking bug 133 can be resolved without the fixes for bug 1369, the proxy has to be smarter: - For each class change the ProxyBroker to create collocated objects the first time a skeleton is instantiated - The collocated proxy looks at the effective target, if it is collocated (this can be cached at IOR creation time) the collocated path is used, otherwise fall back on the remote path. Jeff and Bala already took care of the client side. I'm currently work on the server side. Mine. Update on skeleton refactoring progress: * Enabled/fixed server side "SArg_Traits" supports in TAO_IDL. * Made accompanying fixes to PortableServer files (Upcall_Wrapper.*, ServerRequestInfo.*, etc) * Removed all vestiges of the interceptor related files in TAO_IDL. * Includes zapping all interceptor related visitors, and consequently all interceptor related generated code in the skeleton. * Sweet! * With the above completed, I'm now able to compile the skeleton in Jeff's Param_Test torture test with my changes in place. (All changes/fixes have been committed to my skeleton-refactor branch.) Since refactored skeletons can now be compiled, I was able to get some more meaningful numbers: Skeleton: $TAO_ROOT/tests/Param_Test/param_testS.* Configuration: * g++ 3.4.2 on Debian GNU/Linux "unstable" distribution * Exceptions and inlining enabled * Optimization disabled * Shared library / PIC binaries "old" == "HEAD" branch as of 14 October 2004. "new" == "skeleton-refactor" branch as of today -- split off from HEAD on 14 September 2004. Source Metrics (not preprocessed) ================================= Bytes (old / new / % change): 446820 / 215177 / 52% Lines (old / new / % change): 16402 / 7728 / 53% Object File Metrics =================== Size (using `size' command) ......... HEAD branch .......... $ size .obj/param_testS.o text data bss dec hex filename 362078 4 232 362314 5874a .obj/param_testS.o ......... skeleton-refactor branch .......... $ size .obj/param_testS.o text data bss dec hex filename 196087 8 256 196351 2feff .obj/param_testS.o % change = 46% Updated List of TODO Items ========================== 1. Refactor ThruPOA collocation code in skeleton (direct collocation later) 2. Get linking to work 3. Measure footprint of linked applications 4. Perform run-time regression tests 5. Run performance benchmarks 6. Remember what I forgot to add to this list. :-) Done. See the following ChangeLog entry: Tue Feb 22 02:03:20 2005 Ossama Othman <ossama@dre.vanderbilt.edu> |