[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

RE: Fast thread-local storage for OpenGL drivers



Jakub Jelinek wrote:
> 
> For the dispatch tables I even remember suggesting to:
> ...
> b) in addition to that, you can build an .a library with the above 5 lines
>    per .o file's source plus .hidden Foo which would make apps/libraries
>    using openGL even faster (as they wouldn't hop through PLT, which is
>    one memory load and indirect jump through the loaded value) at the
expense
>    of making offset_Foo part of the openGL ABI (which as far as I
understood
>    already is anyway because of the binary modules).
> c) or you could inline the calls

By default, these are forbidden by the GNU/Linux OpenGL ABI.

> In the May thread, I'm pretty sure you mentioned __indirect* routines
> which are the biggest part of libGL.so are rarely used, which means the
> definitely should be compiled with -fpic, the rest if it is really
> performance critical can be put into awx sections using
> __attribute__((section("..."))).

Sorry, I'm not quite sure what you mean here...

-- 
Gareth Hughes (gareth nvidia com)
OpenGL Developer, NVIDIA Corporation





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]