Hello,
About scalable parallel sorting algorithms..
I just written before the following about my just new inventions:
-------------------------------------------------------------------
Here is my new scalable algorithms inventions..
As you know i am a white arab, and i think i am smart like genius
since i have invented "many" scalable algorithms, and my just new invention is a scalable sorting library that is so powerful, since it uses my new invention of a fully "scalable" Threadpool engine and it uses my new invention of a fully scalable merging algorithm.
--------------------------------------------------------------------
But you have to notice that in cluster computing, the challenge is
to design a highly scalable sorting algorithm that uses minimal communication and maximizes overlap between computation and communication, and uses memory efficiently, since i think in cluster computing the communication between computers in a cluster is so expensive, but since my inventions above of scalable algorithms run in Directory-based coherency in NUMA systems , so i don't have to minimize communication like in cluster computing. Here is my writing about Snooping vs. Directory-based coherency:
Performance Scalability of a Multi-core Web Server
Read more here:
https://www.cse.wustl.edu/ANCS/slides/Bryan%20Veal%20ANCS%20Presentation.pdf
As you notice above that the Address bus saturation causes
poor scaling! And the Address Bus carries requests and
responses for data, called snoops, and more caches mean more sources
and more destinations for snoops that is causing the poor scaling.
So to solve the problem of poor scalability above, you have to use
Directory-based coherence that is a mechanism to handle Cache coherence problem
in Distributed shared memory (DSM) a.k.a. Non-Uniform Memory Access (NUMA).
And you have to know that Directory-Based Cache Coherence is scalable.
Read more here about it:
http://www.cs.cmu.edu/afs/cs/academic/class/15418-s19/www/lectures/13_directory.pdf
So you have to choose Directory-Based Cache Coherence that is scalable by using NUMA systems.
And you have to know that data centers are now typically using NUMA multicore servers that provide "scalable" system performance and "cost-effective" property and that provide Directory-Based Cache Coherence that is scalable.
Read more here:
https://books.google.ca/books?id=3iy6BQAAQBAJ&pg=PA32&lpg=PA32&dq=NUMA+systems+are+cost+effective&source=bl&ots=zXHJZ7oqqW&sig=ACfU3U24MbzxiuPXJB6W6p0JCtkl9hxCHQ&hl=en&sa=X&ved=2ahUKEwjX9o_gubTqAhVBc98KHU0WDJ8Q6AEwCnoECAgQAQ#v=onepage&q=NUMA%20systems%20are%20cost%20effective&f=false
Thank you,
Amine Moulay Ramdane.