external_mesa3d - external/mesa3d

	Commit message (Collapse)	Author	Age	Files	Lines
*	nir: Change nir_shader_get_entrypoint to return an impl.	Kenneth Graunke	2016-08-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Jason suggested adding an assert(function->impl) here. All callers of this function actually want ->impl, so I decided just to change the API. We also change the nir_lower_io_to_temporaries API here. All but one caller passed nir_shader_get_entrypoint(), and with the previous commit, it now uses a nir_function_impl internally. Folding this change in avoids the need to change it and change it back. v2: Fix one call I missed in ir3_compiler (caught by Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>
*	gallium: add a cap to expose whether driver supports mixed color/zs bits	Ilia Mirkin	2016-08-23	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Some hardware can't render to color/depth buffers of mixed bitness. When that happens a fallback has to happen, but this allows the driver to express that this isn't an optimal scenario. The purpose of this is to remove such fbconfigs from the GLX/EGL config list. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>
*	a4xx: make sure to actually clamp depth as requested	Ilia Mirkin	2016-08-19	2	-2/+29
\| \| \| \| \| \| \| \| \| \| \|	We were previously ... not clamping. I guess this meant that everything got clamped to 1/0, which was enough to pass the existing tests. Or perhaps the clamping would only happen to the rasterized depth value and not the frag shader's output depth value. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97231 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org
*	a4xx: only disable depth clipping, not all clipping, when requested	Ilia Mirkin	2016-08-19	2	-1/+4
\| \| \| \| \| \| \| \| \| \|	The previous bit disables the whole clipper, including the regular viewport-related clipping that would go on. The two new bits disable near and far clipping (separately, as verified with the depth-clamp-range piglit). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org
*	ttn: Use nir_load_front_face instead of the TGSI-style input.	Eric Anholt	2016-08-19	1	-46/+0
\| \| \| \| \| \| \|	This reduces the diff between GLSL-to-NIR and TGSI-to-NIR, and gives NIR more optimization to work on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
*	ttn: Make FRAG_RESULT_DEPTH be a float variable to match gtn and ptn.	Eric Anholt	2016-08-19	2	-7/+0
\| \| \| \| \| \| \|	This lets TTN-using drivers handle FRAG_RESULT_DEPTH the same between all their source paths. Reviewed-by: Rob Clark <robdclark@gmail.com>
*	gallium: change pipe_sampler_view::first_element/last_element -> offset/size	Marek Olšák	2016-08-17	3	-8/+5
\| \| \| \| \| \| \| \| \| \| \|	This is required by OpenGL. Our hardware supports this. Example: Bind RGBA32F with offset = 4 bytes. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97305 Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
*	freedreno/a3xx: fix generic clear path	Rob Clark	2016-08-16	1	-0/+1
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a4xx: use generic clear path	Rob Clark	2016-08-16	2	-215/+4
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a3xx: use generic clear path	Rob Clark	2016-08-16	2	-200/+4
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: support for using generic clear path	Rob Clark	2016-08-16	5	-10/+92
\| \| \| \| \| \| \| \|	Since clears are more or less just normal draws, there isn't that much benefit in having hand-rolled clear path. Add support to use u_blitter instead if gen specific backend doesn't implement ctx->clear(). Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a3xx+a4xx: move common VBOs to fd_context	Rob Clark	2016-08-13	10	-185/+116
\| \| \| \| \| \| \| \|	These are the same for a3xx and later. (a2xx could probably use them too, but due to limited hw support and ancient downstream kernels, it isn't so easy to test.) Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a2xx: add missing casts to silence notices	francians@gmail.com	2016-08-13	1	-2/+2
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/ir3: fix issue with emit_tex()	Rob Clark	2016-08-13	1	-19/+28
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	For various tex fetch instructions, coord's get fixed up in different ways. But modifying the array returned from get_src() has side-effects if the same SSA src is used again.. the later instruction will see the previous fixups. Fix this, and const'ify things to prevent this sort of mistake in the future. Noticed by Varad when adding support for txf_ms. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	gallium: add a pipe_context parameter to fence_finish	Marek Olšák	2016-08-10	2	-0/+2
\| \| \| \| \| \| \| \|	required by glClientWaitSync (GL 4.5 Core spec) that can optionally flush the context Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
*	gallium: add render_condition_enable param to clear_render_target/depth_stencil	Marek Olšák	2016-08-10	1	-2/+4
\| \| \| \| \|	Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
*	freedreno/a4xx: fix comparison out of range warnings	francians@gmail.com	2016-07-30	1	-7/+7
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a3xx: fix comparison out of range warnings	francians@gmail.com	2016-07-30	1	-7/+7
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a2xx: fix comparison out of range warnings	francians@gmail.com	2016-07-30	1	-4/+4
\| \| \| \| \|	Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/ir3: init ir3_shader_key with memset()	francians@gmail.com	2016-07-30	1	-1/+2
\| \| \| \| \| \| \|	To silence missing initializers warning Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	gallium/freedreno: move cast to avoid integer overflow	Eric Engestrom	2016-07-30	1	-2/+2
\| \| \| \| \| \| \| \| \|	Previously, the bitshift would be performed on a simple int (32 bits on most systems), overflow, and then be cast to 64 bits. CovID: 1362461 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a2xx: remove duplicate assignment	Eric Engestrom	2016-07-30	1	-2/+2
\| \| \| \| \| \|	CovID: 1362445, 1362446 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: defer flush_queue allocation	Rob Clark	2016-07-30	2	-2/+4
\| \| \| \| \| \| \|	Some apps, like warsow, create a bazillion contexts but don't render on most of them. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: add some hw query traces	Rob Clark	2016-07-30	1	-0/+16
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: some locking	Rob Clark	2016-07-30	9	-23/+157
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: drop needs_rb_fbd	Rob Clark	2016-07-30	6	-31/+12
\| \| \| \| \| \| \| \| \| \| \|	We need to emit RB_FRAME_BUFFER_DIMENSION once per batch.. tracking this in fd_context is wrong when the gmem code executes asynchronously from the flush_queue worker. But in fact we don't really need to track it at all. We cannot assume previous value at the beginning of the batch (because of other processes potentially using the GPU), so just drop the tracking and emit it in _tile_init(). Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: move needs_wfi into batch	Rob Clark	2016-07-30	19	-94/+93
\| \| \| \| \| \| \|	This is also used in gmem code, which executes from the "bottom half" (ie. from the flush_queue worker thread), so it cannot be in fd_context. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: a bit of micro-optimization	Rob Clark	2016-07-30	2	-10/+10
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: drop mem2gmem/gmem2mem query stages	Rob Clark	2016-07-30	2	-17/+1
\| \| \| \| \| \| \| \| \|	They weren't really used, and it gets somewhat more complicated to deal with if batches are flushed asynchronously (on another thread). So just drop them, and move _query_set_state(NULL) call into batch (so it is not happening on background thread). Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: threaded batch flush	Rob Clark	2016-07-30	9	-26/+99
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With the state accessed from GMEM+submit factored out of fd_context and into fd_batch, now it is possible to punt this off to a helper thread. And more importantly, since there are cases where one context might force the batch-cache to flush another context's batches (ie. when there are too many in-flight batches), using a per-context helper thread keeps various different flushes for a given context serialized. TODO as with batch-cache, there are a few places where we'll need a mutex to protect critical sections, which is completely missing at the moment. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: track batch/blit types	Rob Clark	2016-07-30	5	-24/+52
\| \| \| \| \| \| \| \| \| \| \| \|	Add a bit of extra book-keeping about blits and back-blits (from resource shadowing). If the app uploads all mipmap levels, as opposed to uploading the first level and then glGenerateMipmap(), we can discard the back-blit (as opposed to being naive and shadowing the resource for each mipmap level). Also, after a normal blit, we might as well flush the batch immediately, since there is not likely to be further rendering to the surface. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: re-order support for hw queries	Rob Clark	2016-07-30	19	-264/+288
\| \| \| \| \| \| \| \| \| \| \|	Push query state down to batch, and use the resource tracking to figure out which batch(es) need to be flushed to get the query result. This means we actually need to allocate the prsc up front, before we know the size. So we have to add a special way to allocate an un- backed resource, and then later allocate the backing storage. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: use prsc for hw queries	Rob Clark	2016-07-30	3	-35/+45
\| \| \| \| \| \| \| \| \|	Switch to using a pipe_resource (rather than an fd_bo directly) for hw query result buffers. This is first step towards making queries work properly with reordered batches, since we'll need the additional dependency tracking to know which batches to flush. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: support discarding previous rendering in special cases	Rob Clark	2016-07-30	3	-5/+32
\| \| \| \| \| \| \| \| \| \|	Basically, to "DCE" blits triggered by resource shadowing, in cases where the levels are immediately completely overwritten. For example, mid-frame texture upload to level zero triggers shadowing and back-blits to the remaining levels, which are immediately overwritten by glGenerateMipmap(). Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: shadow textures if possible to avoid stall/flush	Rob Clark	2016-07-30	3	-11/+211
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	To make batch re-ordering useful, we need to be able to create shadow resources to avoid a flush/stall in transfer_map(). For example, uploading new texture contents or updating a UBO mid-batch. In these cases, we want to clone the buffer, and update the new buffer, leaving the old buffer (whose reference is held by cmdstream) as a shadow. This is done by blitting the remaining other levels (and whatever part of current level that is not discarded) from the old/shadow buffer to the new one. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: spiff up some debug traces	Rob Clark	2016-07-30	6	-6/+18
\| \| \| \| \| \| \|	Make it easier to track batches, to ensure things happen properly when they are reordered. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: add batch-cache and batch reordering	Rob Clark	2016-07-30	15	-111/+760
\| \| \| \| \| \| \| \| \| \| \| \|	Note that I originally also had a entry-point that would construct a key and do lookup from a pipe_surface. I ended up not needing that (yet?) but it is easy-enough to re-introduce later if we need it for the blit path. For now, not enabled by default, but can be enabled (on a3xx/a4xx) with FD_MESA_DEBUG=reorder. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: move more batch related tracking to fd_batch	Rob Clark	2016-07-30	23	-398/+420
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To flush batches out of order, the gmem code needs to not depend on state from fd_context (since that may apply to a more recent batch). So this all moves into batch. The one exception is the gmem/pipe/tile state itself. But this is only used from gmem code (and batches are flushed serially). The alternative would be having to re-calculate GMEM layout on every batch, even if the dimensions of the render targets are the same. Note: This opens up the possibility of pushing gmem/submit into a helper thread. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: dynamically sized/growable cmd buffers	Rob Clark	2016-07-30	2	-23/+33
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: push resource tracking down into batch	Rob Clark	2016-07-30	7	-42/+51
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: introduce fd_batch	Rob Clark	2016-07-30	20	-177/+252
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Introduce the batch object, to track a batch/submit's worth of ringbuffers and other bookkeeping. In this first step, just move the ringbuffers into batch, since that is mostly uninteresting churn. For now there is just a single batch at a time. Note that one outcome of this change is that rb's are allocated/freed on each use. But the expectation is that the bo pool in libdrm_freedreno will save us the GEM bo alloc/free which was the initial reason to implement a rb pool in gallium. The purpose of the batch is to eventually facilitate out-of-order rendering, with batches associated to framebuffer state, and tracking the dependencies on other batches. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: limit non-user constant buffers to a4xx	Rob Clark	2016-07-29	1	-1/+1
\| \| \| \| \| \| \|	Seems to mostly work on a3xx. Except when it doesn't and kills gpu quite badly. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a4xx: time-elapsed query should be active for clears	Rob Clark	2016-07-24	1	-1/+1
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a4xx: timestamp queries	Rob Clark	2016-07-23	3	-1/+34
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: hw timestamp support	Rob Clark	2016-07-23	2	-2/+15
\| \| \| \| \| \|	If the kernel supports it, use hw counter for timestamps. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno: prep work for timestamp queries	Rob Clark	2016-07-23	3	-6/+10
\| \| \| \| \| \| \| \| \|	We need "NULL" state to be a valid bit in the bitmask, because timestamp queries are not restricted to draw/etc stages (ie. the only commands to submit may just be to read the timestamp). And just because there are no draws, isn't a reason to skip the flush and return zero. Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/ir3: Add missing braces in initializer	francians@gmail.com	2016-07-23	1	-1/+1
\| \| \| \|	Signed-off-by: Rob Clark <robdclark@gmail.com>
*	freedreno/a2xx: silence missing case 'SHADER_COMPUTE' warning (v2)	francians@gmail.com	2016-07-23	1	-0/+2
\| \| \| \| \| \| \|	v2: no need for break after an unreachable (Matt Turner) Signed-off-by: Francesco Ansanelli <francians@gmail.com> Signed-off-by: Rob Clark <robdclark@gmail.com>
*	gallium: split transfer_inline_write into buffer and texture callbacks	Marek Olšák	2016-07-23	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	to reduce the call indirections with u_resource_vtbl. The worst call tree you could get was: - u_transfer_inline_write_vtbl - u_default_transfer_inline_write - u_transfer_map_vtbl - driver_transfer_map - u_transfer_unmap_vtbl - driver_transfer_unmap That's 6 indirect calls. Some drivers only had 5. The goal is to have 1 indirect call for drivers that care. The resource type can be determined statically at most call sites. The new interface is: pipe_context::buffer_subdata(ctx, resource, usage, offset, size, data) pipe_context::texture_subdata(ctx, resource, level, usage, box, data, stride, layer_stride) v2: fix whitespace, correct ilo's behavior Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Acked-by: Roland Scheidegger <sroland@vmware.com>
*	gallium: add a cap for VIEWPORT_SUBPIXEL_BITS (v2)	Józef Kucia	2016-07-20	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	This allows Gallium drivers to advertise the subpixel precision for floating point viewports bounds. v2: - Set ViewportSubpixelBits in st_init_limits. Signed-off-by: Józef Kucia <joseph.kucia@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>