haiwan/ck - ck - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Samy Al Bahra	41ec076372	ck_ht: Add ck_ht_entry_set_key_direct function.	13 years ago
Samy Al Bahra	fb5d9331e2	ck_ht: Add ck_ht_entry_empty function. Function returns true if the initialized ck_ht_entry_t pointed to by its argument does not contain a key-value pair.	13 years ago
Samy Al Bahra	a7031bf938	ck_malloc: Add stdbool.h include.	13 years ago
Samy Al Bahra	2d37b6f828	ck_malloc: Add shared structure for representing data structure-specific allocation functions. This structure is due to change soon.	13 years ago
Samy Al Bahra	9f786337f7	ck_ht: Lock-free SPMC hash table, for x86_64. This is a hash table that is optimized for architectures that implement total store ordering and workloads that are read-heavy involving a single writer and multiple readers. Unlike traditional non-blocking multi-producer/multi-consumer hash table implementations this version allows for immediate re-use of deleted buckets (no need for explicit reclamation cycles) and is more conducive to traditional safe memory reclamation schemes used in unmanaged languages (otherwise, we would require key duplication). It is relatively heavy-weight for MPMC workloads on architectures which do not implement TSO in comparison to Click's MPMC hash table. However, it still has better performance characteristics than a blocking hash table. The committed version currently only provides x86_64 support. This is being committed for review by peers and for a silent release that will allow us to test ck_ht_spmc under high production workloads. Next public release will include additional documentation as well as support for other architectures. In the mean time, please see the unit tests for example usage. Included in this commit: Dropped -Wbad-function-cast from GCC port.	13 years ago
Samy Al Bahra	4fc1717975	ck_epoch: Use full barrier on read begin and load barrier on read end. We shouldn't offload the responsibility of the read_begin flush for shared data mutations to the user. read_end requires a load barrier at the least, not a store barrier.	13 years ago
Samy Al Bahra	674e69f259	ck_queue: Add BSD-derived queue.h facility. Writer-side synchronization is still necessary. My current use-cases call for SLIST and LIST implementations, and as such, I've only implemented support for these. TAILQ facilities will be developed when the time comes that I require them or if there is sufficient user-demand.	13 years ago
Samy Al Bahra	f5f5074b70	ck_pr: Cast down to void pointer for cmpxchg16b wrapper.	13 years ago
Samy Al Bahra	eae4a518a8	ck_epoch: Differentiate read/write epoch endings. ck_epoch_end is deprecated. Please use ck_epoch_read_end and ck_epoch_write_end instead.	13 years ago
Samy Al Bahra	17f69d6c0d	ck_md: Parenthesize CK_MD_CACHELINE.	14 years ago
David Joseph	0123c454f4	ck_pr: Add support for atomic ops for doubles. Implemented add, sub, dec, inc, neg, faa and fas in terms of CAS.	14 years ago
Samy Al Bahra	1cefea7eb6	ck_spinlock: Provide a default implementation. Several users in the past have noted it was difficult for them to decide what spinlock implementation to use. In light of this, a light-weight greedy default is chosen (currently ck_spinlock_fas).	14 years ago
Samy Al Bahra	50f6f6ee02	ck_rwlock: Add ck_rwlock_write_downgrade. This function allows the writer to downgrade to shared access atomically.	14 years ago
Samy Al Bahra	0231b68a8b	ck_spinlock: Add ck_spinlock_*_locked. This class of functions detects whether or not the lock is currently acquired.	14 years ago
Samy Al Bahra	e8a96f4fb8	ck_pr: Add ck_pr_fas_double for {ppc,x86_}64 and ck_pr_cas_double{_2} for x86_64.	14 years ago
Samy Al Bahra	5e7adf5983	ck_pr: Add respective feature flags for ck_pr_load/store_double.	14 years ago
William Irwin	41ef93744a	ck_pr: Add ck_pr_load_double()/ck_pr_store_double() in case volatile loads and stores of floating point values are needed.	14 years ago
Samy Al Bahra	851aaf8dd6	ck_brlock: Allow for recursive reader lock.	14 years ago
Samy Al Bahra	c03dd7a45c	ck_epoch: Update peak value on retire.	14 years ago
Samy Al Bahra	cc4b83793f	ck_epoch: Flip arguments, specify epoch_entry before destructor.	14 years ago
Samy Al Bahra	79c7e453ec	ck_epoch: Specify destructor in ck_epoch_retire.	14 years ago
Samy Al Bahra	10ffb2e6f1	ck_epoch: Support per-object destructors. This increases epoch per-object overhead to 16 bytes.	14 years ago
Samy Al Bahra	0f48b6fe7a	ck_proxy: Add support for recursive epoch sections. Note that the global epoch won't tick over until the parent epoch section has completed.	14 years ago
Samy Al Bahra	b5755888fc	ck_rwlock: Add trylock variants with user-defined spin factor.	14 years ago
Samy Al Bahra	9feb93758f	ck_brlock: Add trylock variants with user-defined spin factor.	14 years ago
Samy Al Bahra	8b0e83e6ab	ck_brlock: No need for explicit pipeline stall if writer is active.	14 years ago
Samy Al Bahra	4e7c6ee270	Add full barrier for anderson spinlock.	14 years ago
Samy Al Bahra	5889498c16	ck_ring: Add modulo logic to ck_ring_size and CK_RING_SIZE.	14 years ago
Samy Al Bahra	d3f0a634ab	ck_fifo: Add a mechanism to extract the spare node from the SPSC FIFO.	14 years ago
Samy Al Bahra	6b627d9aad	ck_fifo: Add trylock interface to dequeue.	14 years ago
Samy Al Bahra	5b37b97f7e	ck_fifo: Add fifo trylock variant.	14 years ago
Samy Al Bahra	76411d5d63	ck_fifo: Add volatile variation of ISEMPTY, isempty.	14 years ago
Samy Al Bahra	c7a12c7ac2	ck_fifo: Add a simple spinlock interface for ck_fifo_spsc.	14 years ago
Samy Al Bahra	16c86b98e5	ck_rwlock: Add publish semantics for rwlock.	14 years ago
Samy Al Bahra	88ee328b78	ck_ring: Add CK_RING_SIZE/CAPACITY and ck_ring_size/capacity. ck_ring_size will return the current size of the ring while ck_ring_capacity will return the capacity of the specified ring.	14 years ago
Samy Al Bahra	006f58edcb	ck_rwlock: Add a naive rw spinlock after many requests.	14 years ago
Samy Al Bahra	786efb9594	ck_brlock: Add big reader spinlocks.	14 years ago
Samy Al Bahra	631d5f93f4	ck_pr/sparcv9: Recent SPARCs actually implement RSO. ck_pr_fence_store/load will enforce store/load barriers by default from now on.	14 years ago
sbahra	49a2820000	Added support for Sun Studio 12 C compiler. build: - configure step will generate relevant CFLAGS. - build profiles are for convenience (developers can use themu for cross-compilation). regressions: - Renamed ck_barrier unit tests to work-around behavior of Solaris linker. - Adopted use of a PTHREAD_CFLAGS variable. ck_cc: - Added internal CK_CC_IMM macro for compilers that are verbose against impossible inline constraints (or limited optimizers). ck_pr/x86*: - Adopted CK_CC_IMM macro. - Dropped redundant constraints. This work was mostly completed by Theo Schlossnagle <jesus@omniti.com>, much thanks to him. He has also provided access to a machine with Sun Studio 12.	14 years ago
Samy Al Bahra	38c614222a	ck_spinlock: Avoid modulus on unlock for power of 2 count.	14 years ago
Samy Al Bahra	69e5c56acb	Merge branch 'master' of git.concurrencykit.org:ck	14 years ago
Samy Al Bahra	f6a2cb1b39	ck_pr/x86_32: Drop 64-bit operations. We just don't care enough about these right now. The real issue is supporting PIC environments. This likely requires stand-alone assembly blobs, something we don't want to deal with right now.	14 years ago
Samy Al Bahra	d761291ab1	ck_pr: Alphabetically order includes.	14 years ago
Samy Al Bahra	3f87223d21	ck_epoch: Collect from previous instance of epoch. Add torture test. This typo lead to incorrect results. Added a read-mostly torture test (follow on work is volatile interface for stack).	14 years ago
Samy Al Bahra	13dd1a4f82	ck_epoch: Update reader-side.	14 years ago
Samy Al Bahra	70860736f6	ck_epoch: Update epoch on read. It's safe to update thread epoch on entry to a read-side section. Without this, it will be very easy to starve writers of reclamation.	14 years ago
Samy Al Bahra	cbe38a9999	ck_epoch: Remove whitespace.	14 years ago
Samy Al Bahra	20fb7a9200	ck_hp: Match epoch semantics. Added improved observability and a ck_hp_purge. The rename matches the naming used in ck_epoch. Documentation updates to follow through soon.	14 years ago
Samy Al Bahra	a72e86e0ba	ck_epoch: Follow-up to previous commit. Some changes were not checked in.	14 years ago
Samy Al Bahra	83f1436f84	ck_epoch: Redesigned and improved unit test and observability. ck_epoch_reclaim is now the replacement for ck_epoch_flush. ck_epoch_purge guarantees that all entries are reclaimed for the provided record before exiting. n_peak counter has been added, which provides the peak number of items across all reclamation lists. n_reclamations provides the number of reclamations across the lifetime of the record. These are cleared on unregister. ck_epoch_update has been renamed to ck_epoch_tick. Hazardous sections which mutate shared structures are now expected to begin with ck_epoch_write_begin and end with ck_epoch_end. Hazardous sections which read shared structures are now expected to begin with ck_epoch_read_begin and end with ck_epoch_end. ck_hp_free is now more aggressive. It will attempt a reclamation cycle any time the pending count is long. I should probably add a ck_hp_retire to have a version which allows for bulk updates to local reclamation lists.	14 years ago
Samy Al Bahra	492faed9a3	Reformatting changes for my new laptop.	14 years ago
Samy Al Bahra	ba11d1e579	ck_sequence: Match up types. Do not use current. It clashes with current in the Linux kernel.	14 years ago
Samy Al Bahra	db9e07625a	ck_hp_fifo: Don't back-off when forwarding pointer.	14 years ago
Samy Al Bahra	ad4b577200	ck_hp_fifo: Store correct value in pointer.	14 years ago
Samy Al Bahra	8c708da8e8	ck_epoch: Use volatile store when updating local epoch.	14 years ago
Samy Al Bahra	7bd5259505	ck_fifo: MPMC variant will return "garbage" pointer which user can re-use.	14 years ago
Samy Al Bahra	158e1580f5	ck_hp_fifo: Fix broken build.	14 years ago
Samy Al Bahra	5180a6fb36	ck_hp_fifo: Add more fences. Add backoff.	14 years ago
Samy Al Bahra	beafb7d78e	ck_fifo: Add back-off and remove recycle. Recycle will just be a bottleneck. The MPMC interface should instead return a junk pointer and allow the user to manage its lifetime in a way they see fit.	14 years ago
Samy Al Bahra	d7d1dfbf50	ck_hp: Remove barrier from set and allow user to batch. A user may want to batch multiple slot updates. Enforcing strong semantics in set would not allow this.	14 years ago
Samy Al Bahra	dffbb4b48f	ck_hp: Add explicit store fence after setting slot.	14 years ago
Samy Al Bahra	2302155613	ck_epoch: Reference Fraser's thesis.	14 years ago
Samy Al Bahra	f22bddddd5	ck_epoch: Place epoch on a separate cache line.	14 years ago
Samy Al Bahra	826d9996ac	ck_epoch: Remove unnecessary header file.	14 years ago
Samy Al Bahra	7b8dfe44be	ck_hp: Rename subscribe to register.	14 years ago
Samy Al Bahra	b123ec2313	ck_epoch: Whitespace changes. Add ck_epoch_recycle.	14 years ago
Samy Al Bahra	875d070814	ck_epoch: Remove debug output, update comments.	14 years ago
Samy Al Bahra	386f3647cb	x86_64: Remove workaround for Opteron errata, other minor clean-up. There is a bug first generation AMD Opteron processors' with cpuid family 0Fh and models less than 40h when it comes to read-modify write operations after load/store sequence. Not worth supporting this processor. If you are on this processor, you can find more information at: http://bugzilla.kernel.org/show_bug.cgi?id=11305#c2	14 years ago
Samy Al Bahra	b882517d5e	PPC64: Complete port, add binary write-only operations.	14 years ago
Samy Al Bahra	fb25458121	ck_barrier: Clean up tournament barriers.	14 years ago
Samy Al Bahra	8b4f72057c	ck_barrier: First round audit, tournament barriers next. The barriers have been restructured into individual file per implementation. Some micro-optimizations were implemented for some barriers (caching common computations in the barrier). State subsription is now explicit with the TID counter allocated on a per-barrier basis. Tournament barriers remaining and then another round will be done for correctness and algorithmic improvements.	14 years ago
Samy Al Bahra	25f1fde7fa	PPC64: Add fetch-and-add.	14 years ago
Samy Al Bahra	f48a0c2480	PPC64: Add unary operations.	14 years ago
Samy Al Bahra	cf4ee8c7a4	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
Samy Al Bahra	5f2f69eebb	Work-around strict aliasing issue.	14 years ago
David Joseph	dff69e639d	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
David Joseph	64f6702a4c	Implemented tournament and mcs barriers. These are the tournament and mcs barriers from "Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors." Validation tests have also been added for these barriers to regressions/ck_barrier/validate.	14 years ago
Samy Al Bahra	24abb2a3ac	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
Samy Al Bahra	90fee0d839	PPC64: Import minimal PPC64 port. Must implement templates now for phi in LL-phi-SC templates to allow for lower latency fetch-and-phi operations.	14 years ago
Samy Al Bahra	a29a1c2a8c	sparcv9: Whitespace change.	14 years ago
Samy Al Bahra	133e936744	sparcv9: Make use of CK_PR_FENCE macro.	14 years ago
Samy Al Bahra	e1d33c467b	ck_pr: Fix some strict aliasing issues and fix char neg.	14 years ago
David Joseph	480db1321c	Implemented dissemination barriers. Validation (ck_barrier_dissemination.c) is included.	14 years ago
Devon H. O'Dell	e5a5d0e2b9	x86: Remove invalid comment about fixed bug.	14 years ago
Devon H. O'Dell	ace2b787f5	x86: Implement ck_pr_load_32_2 in terms of movq ck_pr_load_32_2 (and thus ck_pr_load_ptr_2) were previously implemented in terms of lock cmpxchg8b, which is considerably slower than just using movq. Relevant tests making use of load_ptr_2 still pass, so I'm confident this change is correct.	14 years ago
Samy Al Bahra	ad85634188	Enable x86_32 port.	14 years ago
Devon H. O'Dell	ccf002223c	x86: fix btX functions C casts to unsigned int by default, so we were experiencing some negative undefined behavior in the 1 << 31 case. x86 now works; bts and btc are both passing.	14 years ago
Devon H. O'Dell	0c8c054c0a	x86: More PIC happiness It turns out that the "p" constraint doesn't work on clang, so we have to get rid of that. This means that we may need to require GCC 4.3+ if it turns out that GCC 4.1 / 4.2 still run out of registers compiling this version.	14 years ago
Samy Al Bahra	7c8ab13343	Some fixes for strict aliasing. Silence some warnings from clang.	14 years ago
Devon H. O'Dell	114e9c8ed5	x86: Fix typo missed in last merge somehow	14 years ago
Samy Al Bahra	5900adb424	Move ck_barrier.h into shared/static object. Add CK_CC_CACHELINE. There is absolutely no benefit to inlining these. Additionally, explicitly set the alignment of a ck_barrier_combining_group to a cacheline.	14 years ago
David Joseph	9ab7b7ae42	Fixed a bug with ck_barrier_combining_aux. Since groups are now implemented, leaves can no longer skip to their parents.	14 years ago
David Joseph	615e3ca47f	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
David Joseph	5e7073b9b7	Software tree combining barrier now supports an arbitrary number of threads per group. Changed some structure names for ck_barrier_combining. Modified unit tests to account for groups of threads.	14 years ago
Samy Al Bahra	a462c89787	ck_sequence: Remove unnecessary load fence from seqlock.	14 years ago
Samy Al Bahra	3b63432d03	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
Samy Al Bahra	09bfb13762	ck_barrier: Minor changes, reorganization. Switched to using fas lock. Reorganized to match existing style.	14 years ago
Devon H. O'Dell	22142de58f	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
Devon H. O'Dell	18e1c646fa	x86: Fix buglets Fix a typo that was causing several validation tests to hang. (Doing cmpxchg8b (%eax) isn't going to work very well.) I am wondering if something is wrong with the general implementation of ck_pr_bts_64 and ck_pr_btc_64 because it's pretty clear that with the stack tests passing, ck_pr_cas_32_2_value works fine.	14 years ago
David Joseph	ba3c908fc5	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
David Joseph	8c24a60866	Implemented ck_barrier_combining. This is the software combining tree barrier from the MCS paper. Currently, it uses a binary tree; it may be changed later to use an n-ary tree. Validation (combining_validation.c in regressions/ck_barrier/validate) has also been added.	14 years ago
Devon H. O'Dell	000eb80099	x86: Make things friendlier for PIC and non-PIC builds Making things work properly with PIC on 32-bit x86 architectures is tricky because of our lack of %ebx. Additionally, GCC versions < 4.3 have some problems determining what registers may be reused, causing some of the inline assembly constraints to be a little counterintuitive. (Thanks to Ian Lance Taylor for the suggestion to get around the reuse issues.) This change makes us use sane assembler in cases where we're running non-PIC and use the heavyweight versions only for PIC. There may still be some issues in this code; for example, it's apparent that 64-bit btc and bts intrinsic atomics are broken in the version of GCC I'm using, so those will have to be implemented. Additionally, the ck_stack tests currently don't work with fPIC (not sure if that's the fault of the tests or the port). Everything does pass now in non-PIC, excluding btc/bts tests (in my current environment).	14 years ago
Devon H. O'Dell	8818191ec0	x86: additional changes for support on 32-bit archs Make the assumption that all of our 32-bit x86 architecture targets have SSE and SSE2. This allows us to use MOVQ, which is nicer than using cmpxchg8b for loads / stores. Fix up some of the CAS stuff for -fPIC. This isn't entirely done, and at least ck_fifo_mpmc hangs with this code. Not entirely sure why.	14 years ago
Samy Al Bahra	66ebe1a874	Drop usage of CK_CC_PACKED, prefer natural alignment. CK_CC_PACKED will drop structures to one-byte alignment in certain cases. Obviously, this will mean bad performance on most architectures. Thanks to Matt Johnson from https://rigel.crhc.illinois.edu/ for reporting this problem.	14 years ago
Samy Al Bahra	b5680c42d8	Add sense-reversing centralized barrier. David is interested in implementing ck_barrier.h. These can serve as skeletons.	14 years ago
Samy Al Bahra	d3a033237a	Avoid modulus for wrap-around calculation in ck_anderson. This change has shown a 32% reduction in latency for the reference SPARCv9 platform.	14 years ago
David Joseph	2ff74dd51b	Slight modification to ck_pr.h. Added #undefs for CK_PR_N_Z and CK_PR_N_Z_S.	14 years ago
Samy Al Bahra	3c5b10722e	Alphabetically order includes due to OCD.	14 years ago
David Joseph	041c5c8265	Added support for fixed width uints for all primitives in ck_pr.h. Added NEG_ZERO to ck_pr.h.	14 years ago
Samy Al Bahra	b0f812296e	Add feature flag for ck_pr_cas_32_value on SPARCv9.	14 years ago
Samy Al Bahra	a915d3f9ad	Add SLOT constants for ck_hp_stack.h.	14 years ago
Samy Al Bahra	809089d623	Revert "Commit additional x86 changes to make things work." This reverts commit `0e548375f5`.	14 years ago
Devon H. O'Dell	e7fd074b70	Merge branch 'master' of ssh://git.repnop.org/public/ck	14 years ago
Devon H. O'Dell	0e548375f5	Commit additional x86 changes to make things work. I'm tempted to conglomorate sexual deviances with Linus and git right now that are not entirely appropriate. I'm unimpressed.	14 years ago
Samy Al Bahra	aa9f9b0892	Use ck_stdint.h rather than stdint.h, always.	14 years ago
Devon H. O'Dell	d4c9641c83	API: add atomics for 32-bit x86 architectures Add APIs for doing atomic CAS/load/store/etc on 32-bit platforms. In some cases this also includes operations on 64-bit integers using cmpxchg8b. It is possible we could do some additional stuff on larger integers using SSE, but the goal of this port is to target i586/k5 and newer processors. Samy mentions that we may want to do something or another for making portability easier, but I wasn't paying tons of attention when he was talking, so I forget what that was all about. Plus it's funny to write in a commit message. Haven't done a full run-through of validate / benchmark tests, but a cursory runthrough seems to indicate all passing.	14 years ago
Samy Al Bahra	bcaadcf094	Factor out some common utility functions. Moved rdtsc and affinity logic to a single file which other regression tests use. Single point of reference will ease porting these to future architectures and platforms. Removed invalid Copyright statement. Added CK_CC_USED to force some code generation that I found useful for debugging. Added ck_stack latency tests and a modified version of djoseph's modifications to benchmark.h for spinlock latency tests.	14 years ago
Samy Al Bahra	cbd30b2206	Initial import.	14 years ago

... 10 11 12 13 14

668 Commits (master)