haiwan/ck - ck - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Paul Khuong	bc608c1d7c	gcc/x86{,_64}/ck_pr: unify case enumeration for ck_pr_cas and ck_pr_cas_value The two variants need the exact same set of macro arguments now. TESTED=existing regression tests.	5 years ago
Paul Khuong	e18fba64ec	gcc/x86{,_64}/ck_pr: improve codegen for compare-and-swap, particularly on GCC6+ 1. For ck_pr_cas_foo_value, let the inline assembly save the observed value in a register, and store to the output reference in C. This lets the C optimiser eliminate the memory access once the CAS function is inlined. 2. Specify the result of the CAS as a condition code in EFLAGS instead of executing SETcc in inline assembly, when possible. GCC gained this functionality in GCC 6; CAS loops can now directly branch on the condition code, without SETcc / TEST. TESTED=existing regression tests.	5 years ago
Paul Khuong	b02bb2b805	x86/ck_pr: fix register constraint for ck_pr_foo_is_zero setcc works with byte registers, so we want `q` to ensure the low byte register is encodable, not the general `r`. Fixes https://github.com/concurrencykit/ck/issues/142.	5 years ago
Theo Schlossnagle	1c2469358e	Implement ck_pr_dec_is_zero family of functions (#115 ) * Implement ck_pr_dec_is_zero family of functions * include/ck_pr.h: add ck_pr_{dec,inc}_is_zero and implement ck_pr_{dec,inc}_zero in terms of the new functions. Convert the architecture-specific implementations of ck_pr_foo_zero for x86 and x86-64 to ck_pr_foo_is_zero. * regressions/ck_pr/validate: add smoke tests for ck_pr_dec_{,is_}zero and ck_pr_inc_{,is_}zero * doc: document ck_pr_inc_is_zero	7 years ago
Samy Al Bahra	5517381929	freebsd/x86: Allow and override fence instructions to match kernel environment. This is mostly based off of Olivier's (cognet@) work. I was considering importing in __mbk but realistically, doesn't warrant the refactor.	7 years ago
Samy Al Bahra	a2d1743476	[whitespace] gcc/x86/ck_pr: closing comment for UMP ifdef block.	7 years ago
Samy Al Bahra	52f42c4392	ck_pr: Add ck_pr_fence_acqrel. Provides both acquire and release semantics.	9 years ago
Olivier Houchard	27fb1bc00f	ck: Reimplement a few libc headers, to make CK build when compiling in the FreeBSD kernel.	9 years ago
Samy Al Bahra	1537c8091d	ck_pr: Introduce ck_pr_fence_lock and fence_unlock. These primitives are meant to be used in lock implementations where control dependency ordering is sufficient to enforce ordering of critical section. At the moment, this only affects PPC. Currently, we rely on lwsync for entry into critical sections which is insufficient. sync is rather heavy-weight, and assuming we aren't falling victim into compiler re-ordering, isync should be sufficient. There is follow-up work to be done in ARM, as we may have cheaper (but target-specialized) ISB-tricks for load-load ordering.	10 years ago
John Wittrock	4ef225172e	Make ck_pr_store_* and ck_pr_load_* a bit more type safe. We use some macro trickery to enforce that ck_pr_store_* is actually storing the correct type into the target variable, without any actual side effects--by making the assignment into an rvalue and using a comma expression, the compiler should optimize it away. On the load side, we simply cast the result to the type of the target variable for pointer loads. There is an unsafe version of the store_ptr macro called ck_pr_store_ptr_unsafe for those times when you are _really_ sure that you know what you're doing. This commit also updates some of the source files (ck_ht, ck_hs, ck_rhs): ck_ht now uses the unsafe macro, as its conversion between uintptr_t and void * is invalid under the new macros. ck_hs and ck_rhs have had some casts added to preserve validity.	10 years ago
Samy Al Bahra	554e2f0874	whitespace: Strictly conform to C namespacing rules.	10 years ago
Samy Al Bahra	0dfd145aa6	whitespace: Update Copyright messages.	10 years ago
Samy Al Bahra	9d59c3d004	x86: -Wcast-qual clean-up.	10 years ago
Samy Al Bahra	c197b37df1	legal: Update Copyright statements.	11 years ago
Samy Al Bahra	661f3948ed	ck_pr: Add acquire and release fences.	11 years ago
Samy Al Bahra	5d8a273dbe	whitespace: Bulk whitespace changes.	12 years ago
Samy Al Bahra	adbdfe6633	ck_pr: Get rid of ck_pr_fence_X_X functions. These add unnecessary complexity to the ck_pr_fence interface. Instead, it can be safely assumed that developers will use ck_pr_fence_X to enforce X -> X ordering.	12 years ago
Samy Al Bahra	214d7aed66	ck_pr: Implement ck_pr_fence_atomic in MD ck_pr.	12 years ago
Samy Al Bahra	d1dd6611ac	ck_pr: Add ck_pr_fence_atomic interface. These operations serialize atomic-RMW operations with respect to each other, loads and stores. In addition to this, the load_depends implementations have been removed.	12 years ago
Samy Al Bahra	5506ad2744	ck_pr: Move ck_pr_barrier to compiler port.	12 years ago
Samy Al Bahra	44b769963f	ck_pr: ck_pr_fence_X_Y interface has been added. ck_pr_fence_{load_load,store_store,load_store,store_load} operations have been added. In addition to this, it is no longer the responsibility of architecture ports to determine when to emit a specific fence. Instead, the underlying port will always emit the necessary instructions to enforce strict ordering. The higher-level include/ck_pr implementation will enforce whether or not a fence is necessary to be emitted according to the memory model specified by ck_md (CK_MD_{TSO,RMO,PSO}). In other words, only ck_pr_fence_strict_* is implemented by the MD-ck_pr port.	12 years ago
Samy Al Bahra	33a9222923	legal: Update Copyright statements.	12 years ago
Samy Al Bahra	93684f77c1	ck_pr: Use CK_CC_INLINE instead of inline keyword for ck_pr_barrier.	12 years ago
Samy Al Bahra	12da4128ff	ck_pr: Adopt const qualifer for load/store. We will adopt these semantics for the rest of _ptr family at some point in the future as well.	12 years ago
Samy Al Bahra	3f217c9789	ck_pr: Fallback to RMO for PSO for this release. Barriers can be rejiggered next release.	13 years ago
Samy Al Bahra	a1dc38f20e	build/ck_pr: Add configurable memory models.	13 years ago
Samy Al Bahra	8043f52130	ck_pr: Add ck_pr_barrier for compiler barrier. CK_CC_BARRIER isn't idiomatic, roll this into PR memory model instead.	13 years ago
Samy Al Bahra	9bc4ede14e	ck_pr/x86: Re-order includes.	13 years ago
Samy Al Bahra	3cf265cba0	all: Strip trailing whitespaces.	13 years ago
Samy Al Bahra	706fd07de7	legal: Update Copyright statements.	13 years ago
sbahra	49a2820000	Added support for Sun Studio 12 C compiler. build: - configure step will generate relevant CFLAGS. - build profiles are for convenience (developers can use themu for cross-compilation). regressions: - Renamed ck_barrier unit tests to work-around behavior of Solaris linker. - Adopted use of a PTHREAD_CFLAGS variable. ck_cc: - Added internal CK_CC_IMM macro for compilers that are verbose against impossible inline constraints (or limited optimizers). ck_pr/x86*: - Adopted CK_CC_IMM macro. - Dropped redundant constraints. This work was mostly completed by Theo Schlossnagle <jesus@omniti.com>, much thanks to him. He has also provided access to a machine with Sun Studio 12.	14 years ago
Samy Al Bahra	f6a2cb1b39	ck_pr/x86_32: Drop 64-bit operations. We just don't care enough about these right now. The real issue is supporting PIC environments. This likely requires stand-alone assembly blobs, something we don't want to deal with right now.	14 years ago
Devon H. O'Dell	e5a5d0e2b9	x86: Remove invalid comment about fixed bug.	14 years ago
Devon H. O'Dell	ace2b787f5	x86: Implement ck_pr_load_32_2 in terms of movq ck_pr_load_32_2 (and thus ck_pr_load_ptr_2) were previously implemented in terms of lock cmpxchg8b, which is considerably slower than just using movq. Relevant tests making use of load_ptr_2 still pass, so I'm confident this change is correct.	14 years ago
Devon H. O'Dell	0c8c054c0a	x86: More PIC happiness It turns out that the "p" constraint doesn't work on clang, so we have to get rid of that. This means that we may need to require GCC 4.3+ if it turns out that GCC 4.1 / 4.2 still run out of registers compiling this version.	14 years ago
Devon H. O'Dell	114e9c8ed5	x86: Fix typo missed in last merge somehow	14 years ago
Devon H. O'Dell	18e1c646fa	x86: Fix buglets Fix a typo that was causing several validation tests to hang. (Doing cmpxchg8b (%eax) isn't going to work very well.) I am wondering if something is wrong with the general implementation of ck_pr_bts_64 and ck_pr_btc_64 because it's pretty clear that with the stack tests passing, ck_pr_cas_32_2_value works fine.	14 years ago
Devon H. O'Dell	000eb80099	x86: Make things friendlier for PIC and non-PIC builds Making things work properly with PIC on 32-bit x86 architectures is tricky because of our lack of %ebx. Additionally, GCC versions < 4.3 have some problems determining what registers may be reused, causing some of the inline assembly constraints to be a little counterintuitive. (Thanks to Ian Lance Taylor for the suggestion to get around the reuse issues.) This change makes us use sane assembler in cases where we're running non-PIC and use the heavyweight versions only for PIC. There may still be some issues in this code; for example, it's apparent that 64-bit btc and bts intrinsic atomics are broken in the version of GCC I'm using, so those will have to be implemented. Additionally, the ck_stack tests currently don't work with fPIC (not sure if that's the fault of the tests or the port). Everything does pass now in non-PIC, excluding btc/bts tests (in my current environment).	14 years ago
Devon H. O'Dell	8818191ec0	x86: additional changes for support on 32-bit archs Make the assumption that all of our 32-bit x86 architecture targets have SSE and SSE2. This allows us to use MOVQ, which is nicer than using cmpxchg8b for loads / stores. Fix up some of the CAS stuff for -fPIC. This isn't entirely done, and at least ck_fifo_mpmc hangs with this code. Not entirely sure why.	14 years ago
Samy Al Bahra	809089d623	Revert "Commit additional x86 changes to make things work." This reverts commit `0e548375f5`.	14 years ago
Devon H. O'Dell	0e548375f5	Commit additional x86 changes to make things work. I'm tempted to conglomorate sexual deviances with Linus and git right now that are not entirely appropriate. I'm unimpressed.	14 years ago
Devon H. O'Dell	d4c9641c83	API: add atomics for 32-bit x86 architectures Add APIs for doing atomic CAS/load/store/etc on 32-bit platforms. In some cases this also includes operations on 64-bit integers using cmpxchg8b. It is possible we could do some additional stuff on larger integers using SSE, but the goal of this port is to target i586/k5 and newer processors. Samy mentions that we may want to do something or another for making portability easier, but I wasn't paying tons of attention when he was talking, so I forget what that was all about. Plus it's funny to write in a commit message. Haven't done a full run-through of validate / benchmark tests, but a cursory runthrough seems to indicate all passing.	14 years ago

42 Commits (master)