XP is just a number | |
PerlMonks |
perlman:perlcallby gods (Initiate) |
on Aug 25, 1999 at 07:03 UTC ( [id://415]=perlman: print w/replies, xml ) | Need Help?? |
perlcallCurrent Perl documentation can be found at perldoc.perl.org. Here is our local, out-dated (pre-5.6) version:
NAMEperlcall - Perl calling conventions from C
DESCRIPTIONThe purpose of this document is to show you how to call Perl subroutines directly from C, i.e., how to write callbacks. Apart from discussing the C interface provided by Perl for writing callbacks the document uses a series of examples to show how the interface actually works in practice. In addition some techniques for coding callbacks are covered. Examples where callbacks are necessary include
Although the techniques described here are applicable when embedding Perl in a C program, this is not the primary goal of this document. There are other details that must be considered and are specific to embedding Perl. For details on embedding Perl in C refer to the perlembed manpage. Before you launch yourself head first into the rest of this document, it would be a good idea to have read the following two documents - the perlxs manpage and the perlguts manpage.
THE PERL_CALL FUNCTIONSAlthough this stuff is easier to explain using examples, you first need be aware of a few important definitions. Perl has a number of C functions that allow you to call Perl subroutines. They are
I32 perl_call_sv(SV* sv, I32 flags) ; I32 perl_call_pv(char *subname, I32 flags) ; I32 perl_call_method(char *methname, I32 flags) ; I32 perl_call_argv(char *subname, I32 flags, register char **argv) ; The key function is perl_call_sv. All the other functions are fairly simple wrappers which make it easier to call Perl subroutines in special cases. At the end of the day they will all call perl_call_sv to invoke the Perl subroutine.
All the perl_call_* functions have a Each of the functions will now be discussed in turn.
All the functions return an integer. This is a count of the number of items returned by the Perl subroutine. The actual items returned by the subroutine are stored on the Perl stack. As a general rule you should always check the return value from these functions. Even if you are expecting only a particular number of values to be returned from the Perl subroutine, there is nothing to stop someone from doing something unexpected - don't say you haven't been warned.
FLAG VALUES
The
G_VOIDCalls the Perl subroutine in a void context. This flag has 2 effects:
The value returned by the perl_call_* function indicates how many items have been returned by the Perl subroutine - in this case it will be 0.
G_SCALARCalls the Perl subroutine in a scalar context. This is the default context flag setting for all the perl_call_* functions. This flag has 2 effects:
The value returned by the perl_call_* function indicates how many items have been returned by the Perl subroutine - in this case it will be either 0 or 1. If 0, then you have specified the G_DISCARD flag. If 1, then the item actually returned by the Perl subroutine will be stored on the Perl stack - the section Returning a Scalar shows how to access this value on the stack. Remember that regardless of how many items the Perl subroutine returns, only the last one will be accessible from the stack - think of the case where only one value is returned as being a list with only one element. Any other items that were returned will not exist by the time control returns from the perl_call_* function. The section Returning a list in a scalar context shows an example of this behavior.
G_ARRAYCalls the Perl subroutine in a list context. As with G_SCALAR, this flag has 2 effects:
The value returned by the perl_call_* function indicates how many items have been returned by the Perl subroutine. If 0, then you have specified the G_DISCARD flag. If not 0, then it will be a count of the number of items returned by the subroutine. These items will be stored on the Perl stack. The section Returning a list of values gives an example of using the G_ARRAY flag and the mechanics of accessing the returned items from the Perl stack.
G_DISCARDBy default, the perl_call_* functions place the items returned from by the Perl subroutine on the stack. If you are not interested in these items, then setting this flag will make Perl get rid of them automatically for you. Note that it is still possible to indicate a context to the Perl subroutine by using either G_SCALAR or G_ARRAY. If you do not set this flag then it is very important that you make sure that any temporaries (i.e., parameters passed to the Perl subroutine and values returned from the subroutine) are disposed of yourself. The section Returning a Scalar gives details of how to dispose of these temporaries explicitly and the section Using Perl to dispose of temporaries discusses the specific circumstances where you can ignore the problem and let Perl deal with it for you.
G_NOARGS
Whenever a Perl subroutine is called using one of the perl_call_*
functions, it is assumed by default that parameters are to be passed to the
subroutine. If you are not passing any parameters to the Perl subroutine,
you can save a bit of time by setting this flag. It has the effect of not
creating the Although the functionality provided by this flag may seem straightforward, it should be used only if there is a good reason to do so. The reason for being cautious is that even if you have specified the G_NOARGS flag, it is still possible for the Perl subroutine that has been called to think that you have passed it parameters.
In fact, what can happen is that the Perl subroutine you have called can
access the
sub fred { print "@_\n" }
sub joe { &fred }
&joe(1,2,3) ; This will print
1 2 3
What has happened is that
G_EVALIt is possible for the Perl subroutine you are calling to terminate abnormally, e.g., by calling die explicitly or by not actually existing. By default, when either of these events occurs, the process will terminate immediately. If you want to trap this type of event, specify the G_EVAL flag. It will put an eval { } around the subroutine call.
Whenever control returns from the perl_call_* function you need to check the The value returned from the perl_call_* function is dependent on what other flags have been specified and whether an error has occurred. Here are all the different cases that can occur:
See Using G_EVAL for details on using G_EVAL.
G_KEEPERR
You may have noticed that using the
G_EVAL flag described above will
always clear the
This scenario will mostly be applicable to code that is meant to be called
from within destructors, asynchronous callbacks, signal handlers, The G_KEEPERR flag is meant to be used in conjunction with G_EVAL in perl_call_* functions that are used to implement such code. This flag has no effect when G_EVAL is not used.
When
G_KEEPERR is used, any errors in the called code will
be prefixed with the string ``\t(in cleanup)'', and appended to the current
value of The G_KEEPERR flag was introduced in Perl version 5.002. See Using G_KEEPERR for an example of a situation that warrants the use of this flag.
Determining the ContextAs mentioned above, you can determine the context of the currently executing subroutine in Perl with wantarray. The equivalent test can be made in C by using the perlman:perlguts macro, which returns perlman:perlguts if you have been called in an array context, perlman:perlguts if in a scalar context, or perlman:perlguts if in a void context (i.e. the return value will not be used). An older version of this macro is called perlman:perlguts; in a void context it returns perlman:perlguts instead of perlman:perlguts. An example of using the perlman:perlguts macro is shown in section Using GIMME_V.
KNOWN PROBLEMSThis section outlines all known problems that exist in the perl_call_* functions.
EXAMPLESEnough of the definition talk, let's have a few examples. Perl provides many macros to assist in accessing the Perl stack. Wherever possible, these macros should always be used when interfacing to Perl internals. We hope this should make the code less vulnerable to any changes made to Perl in the future. Another point worth noting is that in the first series of examples I have made use of only the perl_call_pv function. This has been done to keep the code simpler and ease you into the topic. Wherever possible, if the choice is between using perl_call_pv and perl_call_sv, you should always try to use perl_call_sv. See Using perl_call_sv for details.
No Parameters, Nothing returnedThis first trivial example will call a Perl subroutine, PrintUID, to print out the UID of the process.
sub PrintUID { print "UID is $<\n" ; } and here is a C function to call it
static void call_PrintUID() { dSP ;
PUSHMARK(SP) ; perl_call_pv("PrintUID", G_DISCARD|G_NOARGS) ; } Simple, eh. A few points to note about this example.
Passing Parameters
Now let's make a slightly more complex example. This time we want to call a
Perl subroutine, So the Perl subroutine would look like this
sub LeftString { my($s, $n) = @_ ; print substr($s, 0, $n), "\n" ; } The C function required to call LeftString would look like this.
static void call_LeftString(a, b) char * a ; int b ; { dSP ;
ENTER ; SAVETMPS ;
PUSHMARK(SP) ; XPUSHs(sv_2mortal(newSVpv(a, 0))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ;
perl_call_pv("LeftString", G_DISCARD);
FREETMPS ; LEAVE ; } Here are a few notes on the C function call_LeftString.
Returning a ScalarNow for an example of dealing with the items returned from a Perl subroutine. Here is a Perl subroutine, Adder, that takes 2 integer parameters and simply returns their sum.
sub Adder { my($a, $b) = @_ ; $a + $b ; } Because we are now concerned with the return value from Adder, the C function required to call it is now a bit more complex.
static void call_Adder(a, b) int a ; int b ; { dSP ; int count ;
ENTER ; SAVETMPS;
PUSHMARK(SP) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ;
count = perl_call_pv("Adder", G_SCALAR);
SPAGAIN ;
if (count != 1) croak("Big trouble\n") ;
printf ("The sum of %d and %d is %d\n", a, b, POPi) ;
PUTBACK ; FREETMPS ; LEAVE ; } Points to note this time are
Returning a list of valuesNow, let's extend the previous example to return both the sum of the parameters and the difference. Here is the Perl subroutine
sub AddSubtract { my($a, $b) = @_ ; ($a+$b, $a-$b) ; } and this is the C function
static void call_AddSubtract(a, b) int a ; int b ; { dSP ; int count ;
ENTER ; SAVETMPS;
PUSHMARK(SP) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ;
count = perl_call_pv("AddSubtract", G_ARRAY);
SPAGAIN ;
if (count != 2) croak("Big trouble\n") ;
printf ("%d - %d = %d\n", a, b, POPi) ; printf ("%d + %d = %d\n", a, b, POPi) ;
PUTBACK ; FREETMPS ; LEAVE ; } If call_AddSubtract is called like this
call_AddSubtract(7, 4) ; then here is the output
7 - 4 = 3 7 + 4 = 11 Notes
Returning a list in a scalar contextSay the Perl subroutine in the previous section was called in a scalar context, like this
static void call_AddSubScalar(a, b) int a ; int b ; { dSP ; int count ; int i ;
ENTER ; SAVETMPS;
PUSHMARK(SP) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ;
count = perl_call_pv("AddSubtract", G_SCALAR);
SPAGAIN ;
printf ("Items Returned = %d\n", count) ;
for (i = 1 ; i <= count ; ++i) printf ("Value %d = %d\n", i, POPi) ;
PUTBACK ; FREETMPS ; LEAVE ; } The other modification made is that call_AddSubScalar will print the number of items returned from the Perl subroutine and their value (for simplicity it assumes that they are integer). So if call_AddSubScalar is called
call_AddSubScalar(7, 4) ; then the output will be
Items Returned = 1 Value 1 = 3 In this case the main point to note is that only the last item in the list is returned from the subroutine, AddSubtract actually made it back to call_AddSubScalar.
Returning Data from Perl via the parameter listIt is also possible to return values directly via the parameter list - whether it is actually desirable to do it is another matter entirely. The Perl subroutine, Inc, below takes 2 parameters and increments each directly.
sub Inc { ++ $_[0] ; ++ $_[1] ; } and here is a C function to call it.
static void call_Inc(a, b) int a ; int b ; { dSP ; int count ; SV * sva ; SV * svb ;
ENTER ; SAVETMPS;
sva = sv_2mortal(newSViv(a)) ; svb = sv_2mortal(newSViv(b)) ;
PUSHMARK(SP) ; XPUSHs(sva); XPUSHs(svb); PUTBACK ;
count = perl_call_pv("Inc", G_DISCARD);
if (count != 0) croak ("call_Inc: expected 0 values from 'Inc', got %d\n", count) ;
printf ("%d + 1 = %d\n", a, SvIV(sva)) ; printf ("%d + 1 = %d\n", b, SvIV(svb)) ;
FREETMPS ; LEAVE ; }
To be able to access the two parameters that were pushed onto the stack
after they return from perl_call_pv it is necessary to make a note of their addresses - thus the two variables The reason this is necessary is that the area of the Perl stack which held them will very likely have been overwritten by something else by the time control returns from perl_call_pv.
Using G_EVALNow an example using G_EVAL. Below is a Perl subroutine which computes the difference of its 2 parameters. If this would result in a negative result, the subroutine calls die.
sub Subtract { my ($a, $b) = @_ ;
die "death can be fatal\n" if $a < $b ;
$a - $b ; } and some C to call it
static void call_Subtract(a, b) int a ; int b ; { dSP ; int count ;
ENTER ; SAVETMPS;
PUSHMARK(SP) ; XPUSHs(sv_2mortal(newSViv(a))); XPUSHs(sv_2mortal(newSViv(b))); PUTBACK ;
count = perl_call_pv("Subtract", G_EVAL|G_SCALAR);
SPAGAIN ;
/* Check the eval first */ if (SvTRUE(ERRSV)) { printf ("Uh oh - %s\n", SvPV(ERRSV, PL_na)) ; POPs ; } else { if (count != 1) croak("call_Subtract: wanted 1 value from 'Subtract', got %d\n", count) ;
printf ("%d - %d = %d\n", a, b, POPi) ; }
PUTBACK ; FREETMPS ; LEAVE ; } If call_Subtract is called thus
call_Subtract(4, 5) the following will be printed
Uh oh - death can be fatal Notes
Using G_KEEPERRConsider this rather facetious example, where we have used an XS version of the call_Subtract example above inside a destructor:
package Foo; sub new { bless {}, $_[0] } sub Subtract { my($a,$b) = @_; die "death can be fatal" if $a < $b ; $a - $b; } sub DESTROY { call_Subtract(5, 4); } sub foo { die "foo dies"; }
package main; eval { Foo->new->foo }; print "Saw: $@" if $@; # should be, but isn't
This example will fail to recognize that an error occurred inside the
perlfunc:eval. Here's why: the call_Subtract code got executed while perl was cleaning
up temporaries when exiting the eval block, and because call_Subtract is
implemented with perl_call_pv using the
G_EVAL flag, it promptly reset Appending the G_KEEPERR flag, so that the perl_call_pv call in call_Subtract reads:
count = perl_call_pv("Subtract", G_EVAL|G_SCALAR|G_KEEPERR); will preserve the error and restore reliable error handling.
Using perl_call_svIn all the previous examples I have 'hard-wired' the name of the Perl subroutine to be called from C. Most of the time though, it is more convenient to be able to specify the name of the Perl subroutine from within the Perl script. Consider the Perl code below
sub fred { print "Hello there\n" ; }
CallSubPV("fred") ; Here is a snippet of XSUB which defines CallSubPV.
void CallSubPV(name) char * name CODE: PUSHMARK(SP) ; perl_call_pv(name, G_DISCARD|G_NOARGS) ; That is fine as far as it goes. The thing is, the Perl subroutine can be specified as only a string. For Perl 4 this was adequate, but Perl 5 allows references to subroutines and anonymous subroutines. This is where perl_call_sv is useful.
The code below for CallSubSV is identical to CallSubPV except that the
void CallSubSV(name) SV * name CODE: PUSHMARK(SP) ; perl_call_sv(name, G_DISCARD|G_NOARGS) ; Because we are using an SV to call fred the following can all be used
CallSubSV("fred") ; CallSubSV(\&fred) ; $ref = \&fred ; CallSubSV($ref) ; CallSubSV( sub { print "Hello there\n" } ) ; As you can see, perl_call_sv gives you much greater flexibility in how you can specify the Perl subroutine.
You should note that if it is necessary to store the
SV (
static SV * rememberSub ;
void SaveSub1(name) SV * name CODE: rememberSub = name ;
void CallSavedSub1() CODE: PUSHMARK(SP) ; perl_call_sv(rememberSub, G_DISCARD|G_NOARGS) ;
The reason this is wrong is that by the time you come to use the pointer
SaveSub1(\&fred) ; CallSavedSub1() ;
SaveSub1( sub { print "Hello there\n" } ) ; CallSavedSub1() ;
By the time each of the
Can't use an undefined value as a subroutine reference at ...
for each of the Similarly, with this code
$ref = \&fred ; SaveSub1($ref) ; $ref = 47 ; CallSavedSub1() ; you can expect one of these messages (which you actually get is dependent on the version of Perl you are using)
Not a CODE reference at ... Undefined subroutine &main::47 called ...
The variable perlfunc:ref may have referred to the subroutine A similar but more subtle problem is illustrated with this code
$ref = \&fred ; SaveSub1($ref) ; $ref = \&joe ; CallSavedSub1() ;
This time whenever
To get around these problems it is necessary to take a full copy of the
SV. The code below shows
static SV * keepSub = (SV*)NULL ;
void SaveSub2(name) SV * name CODE: /* Take a copy of the callback */ if (keepSub == (SV*)NULL) /* First time, so create a new SV */ keepSub = newSVsv(name) ; else /* Been here before, so overwrite */ SvSetSV(keepSub, name) ;
void CallSavedSub2() CODE: PUSHMARK(SP) ; perl_call_sv(keepSub, G_DISCARD|G_NOARGS) ;
To avoid creating a new
SV every time
Using perl_call_argvHere is a Perl subroutine which prints whatever parameters are passed to it.
sub PrintList { my(@list) = @_ ;
foreach (@list) { print "$_\n" } } and here is an example of perl_call_argv which will call PrintList.
static char * words[] = {"alpha", "beta", "gamma", "delta", NULL} ;
static void call_PrintList() { dSP ;
perl_call_argv("PrintList", G_DISCARD, words) ; } Note that it is not necessary to call perlman:perlguts in this instance. This is because perl_call_argv will do it for you.
Using perl_call_methodConsider the following Perl code
{ package Mine ;
sub new { my($type) = shift ; bless [@_] }
sub Display { my ($self, $index) = @_ ; print "$index: $$self[$index]\n" ; }
sub PrintID { my($class) = @_ ; print "This is Class $class version 1.0\n" ; } }
It implements just a very simple class to manage an array. Apart from the
constructor,
$a = new Mine ('red', 'green', 'blue') ; $a->Display(1) ; PrintID Mine; will print
1: green This is Class Mine version 1.0 Calling a Perl method from C is fairly straightforward. The following things are required
Here is a simple
XSUB which illustrates the mechanics of calling both
the
void call_Method(ref, method, index) SV * ref char * method int index CODE: PUSHMARK(SP); XPUSHs(ref); XPUSHs(sv_2mortal(newSViv(index))) ; PUTBACK;
perl_call_method(method, G_DISCARD) ;
void call_PrintID(class, method) char * class char * method CODE: PUSHMARK(SP); XPUSHs(sv_2mortal(newSVpv(class, 0))) ; PUTBACK;
perl_call_method(method, G_DISCARD) ;
So the methods
$a = new Mine ('red', 'green', 'blue') ; call_Method($a, 'Display', 1) ; call_PrintID('Mine', 'PrintID') ; The only thing to note is that in both the static and virtual methods, the method name is not passed via the stack - it is used as the first parameter to perl_call_method.
Using GIMME_VHere is a trivial XSUB which prints the context in which it is currently executing.
void PrintContext() CODE: I32 gimme = GIMME_V; if (gimme == G_VOID) printf ("Context is Void\n") ; else if (gimme == G_SCALAR) printf ("Context is Scalar\n") ; else printf ("Context is Array\n") ; and here is some Perl to test it
PrintContext ; $a = PrintContext ; @a = PrintContext ; The output from that will be
Context is Void Context is Scalar Context is Array
Using Perl to dispose of temporariesIn the examples given to date, any temporaries created in the callback (i.e., parameters passed on the stack to the perl_call_* function or values returned via the stack) have been freed by one of these methods
There is another method which can be used, namely letting Perl do it for you automatically whenever it regains control after the callback has terminated. This is done by simply not using the
ENTER ; SAVETMPS ; ... FREETMPS ; LEAVE ; sequence in the callback (and not, of course, specifying the G_DISCARD flag). If you are going to use this method you have to be aware of a possible memory leak which can arise under very specific circumstances. To explain these circumstances you need to know a bit about the flow of control between Perl and the callback routine. The examples given at the start of the document (an error handler and an event driven program) are typical of the two main sorts of flow control that you are likely to encounter with callbacks. There is a very important distinction between them, so pay attention. In the first example, an error handler, the flow of control could be as follows. You have created an interface to an external library. Control can reach the external library like this
perl --> XSUB --> external library Whilst control is in the library, an error condition occurs. You have previously set up a Perl callback to handle this situation, so it will get executed. Once the callback has finished, control will drop back to Perl again. Here is what the flow of control will be like in that situation
perl --> XSUB --> external library ... error occurs ... external library --> perl_call --> perl | perl <-- XSUB <-- external library <-- perl_call <----+ After processing of the error using perl_call_* is completed, control reverts back to Perl more or less immediately. In the diagram, the further right you go the more deeply nested the scope is. It is only when control is back with perl on the extreme left of the diagram that you will have dropped back to the enclosing scope and any temporaries you have left hanging around will be freed. In the second example, an event driven program, the flow of control will be more like this
perl --> XSUB --> event handler ... event handler --> perl_call --> perl | event handler <-- perl_call <----+ ... event handler --> perl_call --> perl | event handler <-- perl_call <----+ ... event handler --> perl_call --> perl | event handler <-- perl_call <----+ In this case the flow of control can consist of only the repeated sequence
event handler --> perl_call --> perl for practically the complete duration of the program. This means that control may never drop back to the surrounding scope in Perl at the extreme left. So what is the big problem? Well, if you are expecting Perl to tidy up those temporaries for you, you might be in for a long wait. For Perl to dispose of your temporaries, control must drop back to the enclosing scope at some stage. In the event driven scenario that may never happen. This means that as time goes on, your program will create more and more temporaries, none of which will ever be freed. As each of these temporaries consumes some memory your program will eventually consume all the available memory in your system - kapow! So here is the bottom line - if you are sure that control will revert back to the enclosing Perl scope fairly quickly after the end of your callback, then it isn't absolutely necessary to dispose explicitly of any temporaries you may have created. Mind you, if you are at all uncertain about what to do, it doesn't do any harm to tidy up anyway. |
|