Re: Parrot, threads & fears for the future.

Replies are listed 'Best First'.

Re^2: Parrot, threads & fears for the future.
by Corion (Patriarch) on Oct 23, 2006 at 12:16 UTC

You stopped reading too soon, then. There is much parallelism which could be exploited by having (say) Perl6 constructs be natively parallel, for example the hyperoperators, and grep/map.

Also, having real, painless userspace "threads" (i.e. cooperative multitasking) would be a huge benefit too.

Your reaction is exactly the point that BrowserUk is trying to make, or at least how I interpret it - actually and transparently using and supporting threads is an important asset for a programming language. Having to manually implement IPC, like you have to do if you want to keep on using fork and still reap the benefits of parallelism is a pattern and hence a weakness in the language.

[reply]
[d/l]

Re^3: Parrot, threads & fears for the future.

by Jenda (Abbot) on Oct 23, 2006 at 13:10 UTC

You can only transparently paralelize map{} if the executed block is side-effect-free. Perl is far too dynamic for the compiler/optimizer to safely check that and you might very well want to have side-effects in your map{}s so you'd have to have two map{}s, a paralellisable one and a garanteed-to-be-serial one. It's one thing to paralelize a purely functional, side-effect free language and to paralelize something that may do whatever it bloody wishes. Everything comes for a prize, even freedom.

Jenda
Support Denmark!
Defend the free world!

[reply]

Re^4: Parrot, threads & fears for the future.

by Corion (Patriarch) on Oct 23, 2006 at 13:11 UTC

Sure - but I'm not talking about doing that for Perl5, but postulating it for Perl6, where such stuff is still possible.

[reply]

Re^4: Parrot, threads & fears for the future.

by tphyahoo (Vicar) on Oct 23, 2006 at 15:10 UTC

I think that's exactly what I'm seeking to do in Using functional programming to reduce the pain of parallel-execution programming (with threads, forks, or name your poison)

Here "hashmap" would appear to be the parallelisable function, which is (I hope) side effect free. I have a similar function I'm playing with (not mentioned in the post) which I'm calling vector_reduce. This can't be parallelised.

The two functions together seem to be the conceptual core, or something close by, of MapReduce, made famous by google, and newly on cpan.

[reply]

Re^4: Parrot, threads & fears for the future.

by AK108 (Friar) on Oct 25, 2006 at 23:43 UTC

use parallel

integer

my @list = ('aaaa' .. 'zzzz');
use parallel 'map'; # Just for maps
my @ordlist = map ord, @list;

# Defaults only
# Probably something like any, all, map, grep, and maybe even sort
use parallel;

# Sort by last letter in parallel
@list = sort {($a =~ /\w+(\w)/) <=> ($b =~ /\w+(\w)/)} @list;

# This subroutine can operate in parallel
use parallel 'foreach';
foo($_) foreach @list;

no parallel; # disable everywhere else
[download]

[reply]
[d/l]

Re^5: Parrot, threads & fears for the future.

by Jenda (Abbot) on Oct 26, 2006 at 10:34 UTC

Re^6: Parrot, threads & fears for the future.

by AK108 (Friar) on Oct 26, 2006 at 18:08 UTC

Some notes below your chosen depth have not been shown here

Re^2: Parrot, threads & fears for the future.
by shmem (Chancellor) on Oct 23, 2006 at 12:20 UTC

In unix, thread is spelled "f-o-r-k".

That isn't true even stripping "f-o-r-k" of the flame inducing hyphens, and you know that. fork() means fully cloning a process, proper with environment, file handles, slot in the process table and all mumbo, with no further sharing of resources between the two fully-fledged processes save the mechanisms suitable for any inter-communication of processes (IPC), whereas threads... well, you know. Or you don't. Sad you stopped reading there, it might have been an interesting read further down.

--shmem

_($_=" "x(1<<5)."?\n".q·/)Oo.  G°\        /
                              /\_¯/(q    /
----------------------------  \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}

[reply]
[d/l]

Re^3: Parrot, threads & fears for the future.

by merlyn (Sage) on Oct 23, 2006 at 13:25 UTC

If the answer is "threads", you asked the wrong question. Threads suck. Event-driven programs and processes are a much cleaner model, with a far better defect ratio.

And a forked program cluster will use your two, four, and 20-way core boxes just fine. No need to introduce the bizarre complexity of threads.

Using threads are like using globals... sure you can do it, but it requires a lot more care. Using fork is like having everything be local to its code and data, which is what you want in a large system anyway.

-- Randal L. Schwartz, Perl hacker
Be sure to read my standard disclaimer if this is a reply.

[reply]

Re^4: Parrot, threads & fears for the future.

by shmem (Chancellor) on Oct 23, 2006 at 14:33 UTC

You may have been a bit longer in this business than I (who came to computers off studying architecture around 1989) but I haven't been asleep in the meantime. I also remember the times when a good word processor on the Atari ST (RIP) was worth ~250kBytes on disk; I had my first perl experience with v4.36, and was glad and proud with my simple coding and delighted with Gurusamy's port to the Atari.

That's for that, Father ;-)

But as for threads - set apart the usefulnes of threads, your statement "threads in unix are spelled fork" is blatantly wrong. Threads were invented for a reason which shurely didn't escape you. Which reason? There.

Do threads solve the issues they address, and at which price? Here we might start a discussion, not just saying "threads suck" ("No, they don't suck." - "Yes, they do!" -"NO!") and throwing in false statements in an inflamatory way.

While I also never had the need to use threads, did all I had to do with fork or an event driven model (Event back then, then POE) - so I'm included in that "we were accomplishing" -, I also have to say that "Using threads are like using globals" is misleading, that would be more appropriate "using threads is like using our()".

I agree with you saying "Using fork is like having everything be local to its code and data", which is fine as long as you just want it that way and never ever are in need of shared variables. If you need shared data, you have the option of delegating the data to an external instance or to share it somehow via classical IPC mechanisms, shared memory segments, message queues, pipes and such - but you basically need the same synchronisation mechanisms as with threads (in come semaphores), and sharing complex data structures is IMHO some hells more difficult using IPC.

--shmem

_($_=" "x(1<<5)."?\n".q·/)Oo.  G°\        /
                              /\_¯/(q    /
----------------------------  \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}

[reply]
[d/l]
[select]

Re^5: Parrot, threads & fears for the future.

by merlyn (Sage) on Oct 23, 2006 at 17:32 UTC

Re^6: Parrot, threads & fears for the future.

by shmem (Chancellor) on Oct 23, 2006 at 23:54 UTC

Re^2: Parrot, threads & fears for the future.
by BrowserUk (Patriarch) on Oct 23, 2006 at 12:17 UTC

I stopped reading here: ... In unix, thread is spelled "f-o-r-k".

Had you read on, you might have learnt something.

twinunix - The world is not unix!

Fork is not threading.

Let's see you utilise multiple processes to evaluate

@array1 >> += << @array2;
[download]

On multiple processors concurrently.

Is that sand in your hair?

Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.

Lingua non convalesco, consenesco et abolesco. -- Rule 1 has a caveat! -- Who broke the cabal?

"Science is about questioning the status quo. Questioning authority".

In the absence of evidence, opinion is indistinguishable from prejudice.

[reply]
[d/l]

Re^3: Parrot, threads & fears for the future.

by Anonymous Monk on Oct 23, 2006 at 12:48 UTC

Now, if the process is only halfway its current time slice, and has to give up its timeslice because it needs to wait for the other three thread to join, now, that would be a shame.

I'm not saying that build in support for threading is a bad idea. But I do think it's a bad idea to do things in parallel without the programmers explicit permission. And it ain't going to be easy, as you need to cooperate with the OS, and Perl has to run on many OSses.

[reply]

Re^4: Parrot, threads & fears for the future.

by Corion (Patriarch) on Oct 23, 2006 at 12:51 UTC

I look at thread management much like I now look at memory management. Of course, Perl sacrifices a lot of performance to the reference counting, but it buys a lot of flexibility, convenience and security by using the automatic memory management instead of leaving the programmer to handle the memory, even if that could be more efficient.

[reply]

Re^5: Parrot, threads & fears for the future. (ref counting)

by tye (Sage) on Oct 23, 2006 at 15:04 UTC

Re^6: Parrot, threads & fears for the future. (ref counting)

by chromatic (Archbishop) on Oct 23, 2006 at 15:51 UTC

Some notes below your chosen depth have not been shown here

Re^6: Parrot, threads & fears for the future. (ref counting)

by adrianh (Chancellor) on Oct 23, 2006 at 17:20 UTC

Some notes below your chosen depth have not been shown here

Re^4: Parrot, threads & fears for the future.

by BrowserUk (Patriarch) on Oct 23, 2006 at 20:22 UTC

Let me speculate a little.

Imagine a VM-level 'thread' was not a whole interpeter running on a single kernel thread, but instead was a set of VM registers, a VM 'stack' pointer and a VM program counter. And capable of running on any interpreter in the process.

Now imagine that the VM queried the OS at startup for the number of hyperthreads/cores/CPU (N), and started 1 kernel thread running a separate interpreter in each. And these interpreters all 'listened' to a single, central queue of (pointers to) threads.

When a threadable opcode is reached, if the runtime size of the data to be manipulated is large enough to warrent it, the opcode splits up the dataset into N chunks and queues N copies of itself plus the partition dataset onto the thread queue ('scatter'); prefixed by a 'gather' opcode. Each available processor pulls a copy off, executes on it's partition of the dataset and stores the results.

The first interpreter finished, will pull the 'gather' opcode. It decrements an embedded count and waits upon an embedded semaphore. As each of the interpreters finishes and goes back to get the next piece of work, it does the same until the last finishing interpreter decrements th count to zero, freeing the semaphore and clearing the 'gather' opcode. That frees up all the interpreters to pull the next opcode off the queue and continue. Each waiting interpreter is in a wait state allow the other interpreters to continue, consuming no timeslices, preemptively multitasked by the OS.

The array has been processed, in parallel, by as many threads are available. No user space locking required. No threads needed to be started, nor torn down. Additional opcodes processed by the threaded code pushes are pushed onto the queue (stackwise).

The same VM thread (register set) forms the basis of user space threading for cooperative multi-tasking like in Java, Ruby and many more. The same thread structure is used for both forms of threading.

Just a thought (or five:).

Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.

Lingua non convalesco, consenesco et abolesco. -- Rule 1 has a caveat! -- Who broke the cabal?

"Science is about questioning the status quo. Questioning authority".

In the absence of evidence, opinion is indistinguishable from prejudice.

[reply]

Re^5: Parrot, threads & fears for the future.

by liz (Monsignor) on Oct 23, 2006 at 23:14 UTC

Re^6: Parrot, threads & fears for the future.

by BrowserUk (Patriarch) on Oct 24, 2006 at 03:09 UTC


Think about Loose Coupling
	PerlMonks