Parent Directory
|
Revision Log
Changed useless "warn" to useful "cluck" in &Trace(). -- /gdp
# -*- perl -*- ######################################################################## # Copyright (c) 2003-2006 University of Chicago and Fellowship # for Interpretations of Genomes. All Rights Reserved. # # This file is part of the SEED Toolkit. # # The SEED Toolkit is free software. You can redistribute # it and/or modify it under the terms of the SEED Toolkit # Public License. # # You should have received a copy of the SEED Toolkit Public License # along with this program; if not write to the University of Chicago # at info@ci.uchicago.edu or the Fellowship for Interpretation of # Genomes at veronika@thefig.info or download a copy from # http://www.theseed.org/LICENSE.TXT. ######################################################################## package Tracer; require Exporter; @ISA = ('Exporter'); @EXPORT = qw(Trace T TSetup QTrace Confess Cluck Min Max Assert Open OpenDir TICK StandardSetup EmergencyKey ETracing ScriptSetup ScriptFinish Insure ChDir Emergency); @EXPORT_OK = qw(GetFile GetOptions Merge MergeOptions ParseCommand ParseRecord UnEscape Escape); use strict; use Carp qw(longmess croak); use CGI; use Cwd; use FIG_Config; use PageBuilder; use Digest::MD5; use File::Basename; use File::Path; use File::stat; use LWP::UserAgent; use Time::HiRes 'gettimeofday'; use URI::Escape; use Time::Local; =head1 Tracing and Debugging Helpers =head2 Tracing This package provides simple tracing for debugging and reporting purposes. To use it simply call the L</TSetup> or L</ETracing> method to set the options and call L</Trace> to write out trace messages. L</TSetup> and L</ETracing> both establish a I<trace level> and a list of I<categories>. Similarly, each trace message has a I<trace level> and I<category> associated with it. Only messages whose trace level is less than or equal to the setup trace level and whose category is activated will be written. Thus, a higher trace level on a message indicates that the message is less likely to be seen, while a higher trace level passed to B<TSetup> means more trace messages will appear. =head3 Putting Trace Messages in Your Code To generate a trace message, use the following syntax. Trace($message) if T(errors => 4); This statement will produce a trace message if the trace level is 4 or more and the C<errors> category is active. There is a special category C<main> that is always active, so Trace($message) if T(main => 4); will trace if the trace level is 4 or more. If the category name is the same as the package name, all you need is the number. So, if the following call is made in the B<Sprout> package, it will appear if the C<Sprout> category is active and the trace level is 2 or more. Trace($message) if T(2); In scripts, where no package name is available, the category defaults to C<main>. =head3 Custom Tracing Many programs have customized tracing configured using the L</TSetup> method. This is no longer the preferred method, but a knowledge of how custom tracing works can make the more modern L</Emergency Tracing> easier to understand. To set up custom tracing, you call the L</TSetup> method. The method takes as input a trace level, a list of category names, and a destination. The trace level and list of category names are specified as a space-delimited string. Thus TSetup('3 errors Sprout ERDB', 'TEXT'); sets the trace level to 3, activates the C<errors>, C<Sprout>, and C<ERDB> categories, and specifies that messages should be sent to the standard output. To turn on tracing for ALL categories, use an asterisk. The call below sets every category to level 3 and writes the output to the standard error output. This sort of thing might be useful in a CGI environment. TSetup('3 *', 'WARN'); In addition standard error and file output for trace messages, you can specify that the trace messages be queued. The messages can then be retrieved by calling the L</QTrace> method. This approach is useful if you are building a web page. Instead of having the trace messages interspersed with the page output, they can be gathered together and displayed at the end of the page. This makes it easier to debug page formatting problems. Finally, you can specify that all trace messages be emitted to a file, or the standard output and a file at the same time. To trace to a file, specify the filename with an output character in front of it. TSetup('4 SQL', ">$fileName"); To trace to the standard output and a file at the same time, put a C<+> in front of the angle bracket. TSetup('3 *', "+>$fileName"); The flexibility of tracing makes it superior to simple use of directives like C<die> and C<warn>. Tracer calls can be left in the code with minimal overhead and then turned on only when needed. Thus, debugging information is available and easily retrieved even when the application is being used out in the field. =head3 Trace Levels There is no hard and fast rule on how to use trace levels. The following is therefore only a suggestion. =over 4 =item Error 0 Message indicates an error that may lead to incorrect results or that has stopped the application entirely. =item Warning 1 Message indicates something that is unexpected but that probably did not interfere with program execution. =item Notice 2 Message indicates the beginning or end of a major task. =item Information 3 Message indicates a subtask. In the FIG system, a subtask generally relates to a single genome. This would be a big loop that is not expected to execute more than 500 times or so. =item Detail 4 Message indicates a low-level loop iteration. =back The format of trace messages is important because some utilities analyze trace files. There are three fields-- the time stamp, the category name, and the text. The time stamp is between square brackets and the category name between angle brackets. After the category name there is a colon (C<:>) followed by the message text. If the square brackets or angle brackets are missing, then the trace management utilities assume that they are encountering a set of pre-formatted lines. Note, however, that this formatting is done automatically by the tracing functions. You only need to know about it if you want to parse a trace file. =head3 Emergency Tracing Sometimes, you need a way for tracing to happen automatically without putting parameters in a form or on the command line. Emergency tracing does this. You invoke emergency tracing from the debug form, which is accessed from I<MySeedInstance>C</FIG/Html/SetPassword.html>. Emergency tracing requires you specify a tracing key. For command-line tools, the key is taken from the C<TRACING> environment variable. For web services, the key is taken from a cookie. Either way, the key tells the tracing facility who you are, so that you control the tracing in your environment without stepping on other users. The key can be anything you want. If you don't have a key, the C<SetPassword> page will generate one for you. You can activate and de-activate emergency tracing from the debugging control panel, as well as display the trace file itself. To enable emergency tracing in your code, call ETracing($cgi) from a web script and ETracing() from a command-line script. The web script will look for the tracing key in the cookies, and the command-line script will look for it in the C<TRACING> environment variable. If you are using the L</StandardScript> or L</StandardSetup> methods, emergency tracing will be configured automatically. NOTE: to configure emergency tracing from the command line instead of the Debugging Control Panel (see below), use the C<trace.pl> script. =head3 Debugging Control Panel The debugging control panel provides several tools to assist in development of SEED and Sprout software. You access the debugging control panel from the URL C</FIG/Html/SetPassword.html> in whichever seed instance you're using. (So, for example, the panel access point for the development NMPDR system is C<http://web-1.nmpdr.org/next/FIG/Html/SetPassword.html>. Contact Bruce to find out what the password is. From this page, you can also specify a tracing key. If you don't specify a key, one will be generated for you. =head4 Emergency Tracing Form At the bottom of the debugging control panel is a form that allows you to specify a trace level and tracing categories. Special and common categories are listed with check boxes. You can hold your mouse over a check box to see what its category does. In general, however, a category name is the same as the name of the package in which the trace message occurs. Additional categories can be entered in an input box, delimited by spaces or commas. The B<Activate> button turns on Emergency tracing at the level you specify with the specified categories active. The B<Terminate> button turns tracing off. The B<Show File> button displays the current contents of the trace file. The tracing form at the bottom of the control panel is designed for emergency tracing, so it will only affect programs that call L</ETracing>, L</StandardScript>, or L</StandardSetup>. =head4 Script Form The top form of the debugging control panel allows you to enter a tiny script and have the output generated in a formatted table. Certain object variables are predefined in the script, including a FIG object (C<$fig>), a CGI object (C<$cgi>), and-- if Sprout is active-- Sprout (C<$sprout>) and SFXlate (C<$sfx>) objects. The last line of the script must be a scalar, but it can be a reference to a hash, a list, a list of lists, and various other combinations. If you select the appropriate data type in the dropdown box, the output will be formatted accordingly. The form also has controls for specifying tracing. These controls override any emergency tracing in effect. =head4 Database Query Forms The forms between the script form and the emergency tracing form allow you to make queries against the database. The FIG query form allows simple queries against a single FIG table. The Sprout query form uses the B<GetAll> method to do a multi-table query against the Sprout database. B<GetAll> is located in the B<ERDB> package, and it takes five parameters. GetAll(\@objectNames, $filterClause, \@parameters, \@fields, $count); Each of the five parameters corresponds to a text box on the query form: =over 4 =item Objects Comma-separated list containing the names of the entity and relationship objects to be retrieved. =item Filter WHERE/ORDER BY clause (without the WHERE) to be used to filter and sort the query. The WHERE clause can be parameterized with parameter markers (C<?>). Each field used must be specified in the standard form B<I<objectName>(I<fieldName>)> or B<$I<number>(I<fieldName>)> where I<fieldName> is the name of a field, I<objectName> is the name of the entity or relationship object containing the field, and I<number> is the 1-based position of the object in the object list. Any parameters specified in the filter clause should be specified in the B<Params> field. The fields in a filter clause can come from primary entity relations, relationship relations, or secondary entity relations; however, all of the entities and relationships involved must be included in the list of object names. =item Params List of the parameters to be substituted in for the parameters marks in the filter clause. This is a comma-separated list without any quoting or escaping. =item fields Comma-separated list of the fields to be returned in each element of the list returned. Fields are specified in the same manner as in the filter clause. =item count Maximum number of records to return. If omitted or 0, all available records will be returned. =back B<GetAll> automatically joins together the entities and relationships listed in the object names. This simplifies the coding of the filter clause, but it means that some queries are not possible, since they cannot be expressed in a linear sequence of joins. This is a limitation that has yet to be addressed. =cut # Declare the configuration variables. my $Destination = "WARN"; # Description of where to send the trace output. my $TeeFlag = 0; # TRUE if output is going to a file and to the # standard output my %Categories = ( main => 1 ); # hash of active category names my $TraceLevel = 0; # trace level; a higher trace level produces more # messages my @Queue = (); # queued list of trace messages. my $LastCategory = "main"; # name of the last category interrogated my $SetupCount = 0; # number of times TSetup called my $AllTrace = 0; # TRUE if we are tracing all categories. =head2 Tracing Methods =head3 Setups my $count = Tracer::Setups(); Return the number of times L</TSetup> has been called. This method allows for the creation of conditional tracing setups where, for example, we may want to set up tracing if nobody else has done it before us. =cut sub Setups { return $SetupCount; } =head3 TSetup TSetup($categoryList, $target); This method is used to specify the trace options. The options are stored as package data and interrogated by the L</Trace> and L</T> methods. =over 4 =item categoryList A string specifying the trace level and the categories to be traced, separated by spaces. The trace level must come first. =item target The destination for the trace output. To send the trace output to a file, specify the file name preceded by a ">" symbol. If a double symbol is used (">>"), then the data is appended to the file. Otherwise the file is cleared before tracing begins. Precede the first ">" symbol with a C<+> to echo output to a file AND to the standard output. In addition to sending the trace messages to a file, you can specify a special destination. C<HTML> will cause tracing to the standard output with each line formatted as an HTML paragraph. C<TEXT> will cause tracing to the standard output as ordinary text. C<ERROR> will cause trace messages to be sent to the standard error output as ordinary text. C<QUEUE> will cause trace messages to be stored in a queue for later retrieval by the L</QTrace> method. C<WARN> will cause trace messages to be emitted as warnings using the B<warn> directive. C<NONE> will cause tracing to be suppressed. =back =cut sub TSetup { # Get the parameters. my ($categoryList, $target) = @_; # Parse the category list. my @categoryData = split /\s+/, $categoryList; # Extract the trace level. $TraceLevel = shift @categoryData; # Presume category-based tracing until we learn otherwise. $AllTrace = 0; # Build the category hash. Note that if we find a "*", we turn on non-category # tracing. We must also clear away any pre-existing data. %Categories = ( main => 1 ); for my $category (@categoryData) { if ($category eq '*') { $AllTrace = 1; } else { $Categories{lc $category} = 1; } } # Now we need to process the destination information. The most important special # cases are the single ">", which requires we clear the file first, and the # "+" prefix which indicates a double echo. if ($target =~ m/^\+?>>?/) { if ($target =~ m/^\+/) { $TeeFlag = 1; $target = substr($target, 1); } if ($target =~ m/^>[^>]/) { open TRACEFILE, $target; print TRACEFILE "[" . Now() . "] <Tracer>: Tracing initialized.\n"; close TRACEFILE; $Destination = ">$target"; } else { $Destination = $target; } } else { $Destination = uc($target); } # Increment the setup counter. $SetupCount++; } =head3 SetLevel Tracer::SetLevel($newLevel); Modify the trace level. A higher trace level will cause more messages to appear. =over 4 =item newLevel Proposed new trace level. =back =cut sub SetLevel { $TraceLevel = $_[0]; } =head3 ParseTraceDate my $time = Tracer::ParseTraceDate($dateString); Convert a date from the trace file into a PERL timestamp. =over 4 =item dateString The date string from the trace file. The format of the string is determined by the L</Now> method. =item RETURN Returns a PERL time, that is, a number of seconds since the epoch, or C<undef> if the time string is invalid. =back =cut sub ParseTraceDate { # Get the parameters. my ($dateString) = @_; # Declare the return variable. my $retVal; # Parse the date. if ($dateString =~ m#(\d+)/(\d+)/(\d+)\s+(\d+):(\d+):(\d+)#) { # Create a time object. Note we need to convert the day, month, # and year to a different base. Years count from 1900, and # the internal month value is relocated to January = 0. $retVal = timelocal($6, $5, $4, $2, $1 - 1, $3 - 1900); } # Return the result. return $retVal; } =head3 LogErrors Tracer::LogErrors($fileName); Route the standard error output to a log file. =over 4 =item fileName Name of the file to receive the error output. =back =cut sub LogErrors { # Get the file name. my ($fileName) = @_; # Open the file as the standard error output. open STDERR, '>', $fileName; } =head3 Trace Trace($message); Write a trace message to the target location specified in L</TSetup>. If there has not been any prior call to B<TSetup>. =over 4 =item message Message to write. =back =cut sub Trace { # Get the parameters. my ($message) = @_; # Get the timestamp. my $timeStamp = Now(); # Format the message. Note we strip off any line terminators at the end. my $prefix = "[$timeStamp] <$LastCategory>: "; my $formatted = $prefix . Strip($message); # Process according to the destination. if ($Destination eq "TEXT") { # Write the message to the standard output. print "$formatted\n"; } elsif ($Destination eq "ERROR") { # Write the message to the error output. print STDERR "$formatted\n"; } elsif ($Destination eq "QUEUE") { # Push the message into the queue. push @Queue, "$formatted"; } elsif ($Destination eq "HTML") { # Convert the message to HTML and write it to the standard output. my $escapedMessage = CGI::escapeHTML($message); print "<p>$timeStamp $LastCategory: $escapedMessage</p>\n"; } elsif ($Destination eq "WARN") { # Emit the message as a warning. carp $message; } elsif ($Destination =~ m/^>>/) { # Write the trace message to an output file. (open TRACING, $Destination) || die "Tracing open for \"$Destination\" failed: $!"; print TRACING "$formatted\n"; close TRACING; # If the Tee flag is on, echo it to the standard output. if ($TeeFlag) { print "$formatted\n"; } } } =head3 T my $switch = T($category, $traceLevel); or my $switch = T($traceLevel); Return TRUE if the trace level is at or above a specified value and the specified category is active, else FALSE. If no category is specified, the caller's package name is used. =over 4 =item category Category to which the message belongs. If not specified, the caller's package name is used. =item traceLevel Relevant tracing level. =item RETURN TRUE if a message at the specified trace level would appear in the trace, else FALSE. =back =cut sub T { # Declare the return variable. my $retVal = 0; # Only proceed if tracing is turned on. if ($Destination ne "NONE") { # Get the parameters. my ($category, $traceLevel) = @_; if (!defined $traceLevel) { # Here we have no category, so we need to get the calling package. # The calling package is normally the first parameter. If it is # omitted, the first parameter will be the tracelevel. So, the # first thing we do is shift the so-called category into the # $traceLevel variable where it belongs. $traceLevel = $category; my ($package, $fileName, $line) = caller; # If there is no calling package, we default to "main". if (!$package) { $category = "main"; } else { my @cats = split /::/, $package; $category = $cats[$#cats]; } } # Save the category name. $LastCategory = $category; # Convert it to lower case before we hash it. $category = lc $category; # Use the category and tracelevel to compute the result. if (ref $traceLevel) { Confess("Bad trace level."); } elsif (ref $TraceLevel) { Confess("Bad trace config."); } $retVal = ($traceLevel <= $TraceLevel && ($AllTrace || exists $Categories{$category})); } # Return the computed result. return $retVal; } =head3 QTrace my $data = QTrace($format); Return the queued trace data in the specified format. =over 4 =item format C<html> to format the data as an HTML list, C<text> to format it as straight text. =back =cut sub QTrace { # Get the parameter. my ($format) = @_; # Create the return variable. my $retVal = ""; # Only proceed if there is an actual queue. if (@Queue) { # Process according to the format. if ($format =~ m/^HTML$/i) { # Convert the queue into an HTML list. $retVal = "<ul>\n"; for my $line (@Queue) { my $escapedLine = CGI::escapeHTML($line); $retVal .= "<li>$escapedLine</li>\n"; } $retVal .= "</ul>\n"; } elsif ($format =~ m/^TEXT$/i) { # Convert the queue into a list of text lines. $retVal = join("\n", @Queue) . "\n"; } # Clear the queue. @Queue = (); } # Return the formatted list. return $retVal; } =head3 Confess Confess($message); Trace the call stack and abort the program with the specified message. When used with the OR operator and the L</Assert> method, B<Confess> can function as a debugging assert. So, for example Assert($recNum >= 0) || Confess("Invalid record number $recNum."); Will abort the program with a stack trace if the value of C<$recNum> is negative. =over 4 =item message Message to include in the trace. =back =cut sub Confess { # Get the parameters. my ($message) = @_; if (! defined($FIG_Config::no_tool_hdr)) { # Here we have a tool header. Display its length so that the user can adjust the line numbers. my $toolHeaderFile = "$FIG_Config::fig_disk/dist/releases/current/$FIG_Config::arch/tool_hdr"; # Only proceed if the tool header file is actually present. if (-f $toolHeaderFile) { my @lines = GetFile($toolHeaderFile); Trace("Tool header has " . scalar(@lines) . " lines."); } } # Trace the call stack. Cluck($message); # Abort the program. croak(">>> $message"); } =head3 Assert Assert($condition1, $condition2, ... $conditionN); Return TRUE if all the conditions are true. This method can be used in conjunction with the OR operator and the L</Confess> method as a debugging assert. So, for example Assert($recNum >= 0) || Confess("Invalid record number $recNum."); Will abort the program with a stack trace if the value of C<$recNum> is negative. =cut sub Assert { my $retVal = 1; LOOP: for my $condition (@_) { if (! $condition) { $retVal = 0; last LOOP; } } return $retVal; } =head3 Cluck Cluck($message); Trace the call stack. Note that for best results, you should qualify the call with a trace condition. For example, Cluck("Starting record parse.") if T(3); will only trace the stack if the trace level for the package is 3 or more. =over 4 =item message Message to include in the trace. =back =cut sub Cluck { # Get the parameters. my ($message) = @_; # Trace what's happening. Trace("Stack trace for event: $message"); my $confession = longmess($message); # Convert the confession to a series of trace messages. Note we skip any # messages relating to calls into Tracer. for my $line (split /\s*\n/, $confession) { Trace($line) if ($line !~ /Tracer\.pm/); } } =head3 ScriptSetup my ($cgi, $varHash) = ScriptSetup($noTrace); Perform standard tracing and debugging setup for scripts. The value returned is the CGI object followed by a pre-built variable hash. At the end of the script, the client should call L</ScriptFinish> to output the web page. This method calls L</ETracing> to configure tracing, which allows the tracing to be configured via the emergency tracing form on the debugging control panel. Tracing will then be turned on automatically for all programs that use the L</ETracing> method, which includes every program that uses this method or L</StandardSetup>. =over 4 =item noTrace (optional) If specified, tracing will be suppressed. This is useful if the script wants to set up tracing manually. =item RETURN Returns a two-element list consisting of a CGI query object and a variable hash for the output page. =back =cut sub ScriptSetup { # Get the parameters. my ($noTrace) = @_; # Get the CGI query object. my $cgi = CGI->new(); # Set up tracing if it's not suppressed. ETracing($cgi) unless $noTrace; # Create the variable hash. my $varHash = { results => '' }; # Return the query object and variable hash. return ($cgi, $varHash); } =head3 ETracing ETracing($parameter); Set up emergency tracing. Emergency tracing is tracing that is turned on automatically for any program that calls this method. The emergency tracing parameters are stored in a a file identified by a tracing key. If this method is called with a CGI object, then the tracing key is taken from a cookie. If it is called with no parameters, then the tracing key is taken from an environment variable. If it is called with a string, the tracing key is that string. =over 4 =item parameter A parameter from which the tracing key is computed. If it is a scalar, that scalar is used as the tracing key. If it is a CGI object, the tracing key is taken from the C<IP> cookie. If it is omitted, the tracing key is taken from the C<TRACING> environment variable. If it is a CGI object and emergency tracing is not on, the C<Trace> and C<TF> parameters will be used to determine the type of tracing. =back =cut sub ETracing { # Get the parameter. my ($parameter) = @_; # Check for CGI mode. my $cgi = (ref $parameter eq 'CGI' ? $parameter : undef); # Default to no tracing except errors. my ($tracing, $dest) = ("0", "WARN"); # Check for emergency tracing. my $tkey = EmergencyKey($parameter); my $emergencyFile = EmergencyFileName($tkey); if (-e $emergencyFile) { # We have the file. Read in the data. my @tracing = GetFile($emergencyFile); # Pull off the time limit. my $expire = shift @tracing; # Convert it to seconds. $expire *= 3600; # Check the file data. my $stat = stat($emergencyFile); my ($now) = gettimeofday; if ($now - $stat->mtime > $expire) { # Delete the expired file. unlink $emergencyFile; } else { # Emergency tracing is on. Pull off the destination and # the trace level; $dest = shift @tracing; my $level = shift @tracing; # Convert the destination to a real tracing destination. # temp directory. $dest = EmergencyTracingDest($tkey, $dest); # Insure Tracer is specified. my %moduleHash = map { $_ => 1 } @tracing; $moduleHash{Tracer} = 1; # Set the trace parameter. $tracing = join(" ", $level, sort keys %moduleHash); } } elsif (defined $cgi) { # There's no emergency tracing, but we have a CGI object, so check # for tracing from the form parameters. if ($cgi->param('Trace')) { # Here the user has requested tracing via a form. $dest = ($cgi->param('TF') ? ">$FIG_Config::temp/Trace$$.log" : "QUEUE"); $tracing = $cgi->param('Trace') . " Tracer"; } } # Setup the tracing we've determined from all the stuff above. TSetup($tracing, $dest); # Check to see if we're a web script. if (defined $cgi) { # Yes we are. Trace the form and environment data. TraceParms($cgi); # Check for RAW mode. In raw mode, we print a fake header so that we see everything # emitted by the script in its raw form. if (T(Raw => 3)) { print CGI::header(-type => 'text/plain', -tracing => 'Raw'); } } } =head3 EmergencyFileName my $fileName = Tracer::EmergencyFileName($tkey); Return the emergency tracing file name. This is the file that specifies the tracing information. =over 4 =item tkey Tracing key for the current program. =item RETURN Returns the name of the file to contain the emergency tracing information. =back =cut sub EmergencyFileName { # Get the parameters. my ($tkey) = @_; # Compute the emergency tracing file name. return "$FIG_Config::temp/Emergency$tkey.txt"; } =head3 EmergencyFileTarget my $fileName = Tracer::EmergencyFileTarget($tkey); Return the emergency tracing target file name. This is the file that receives the tracing output for file-based tracing. =over 4 =item tkey Tracing key for the current program. =item RETURN Returns the name of the file to contain the trace output. =back =cut sub EmergencyFileTarget { # Get the parameters. my ($tkey) = @_; # Compute the emergency tracing file name. return "$FIG_Config::temp/trace$tkey.log"; } =head3 EmergencyTracingDest my $dest = Tracer::EmergencyTracingDest($tkey, $myDest); This method converts an emergency tracing destination to a real tracing destination. The main difference is that if the destination is C<FILE> or C<APPEND>, we convert it to file output. If the destination is C<DUAL>, we convert it to file and standard output. =over 4 =item tkey Tracing key for this environment. =item myDest Destination from the emergency tracing file. =item RETURN Returns a destination that can be passed into L</TSetup>. =back =cut sub EmergencyTracingDest { # Get the parameters. my ($tkey, $myDest) = @_; # Declare the return variable. my $retVal = $myDest; # Process according to the destination value. if ($myDest eq 'FILE') { $retVal = ">" . EmergencyFileTarget($tkey); } elsif ($myDest eq 'APPEND') { $retVal = ">>" . EmergencyFileTarget($tkey); } elsif ($myDest eq 'DUAL') { $retVal = "+>" . EmergencyFileTarget($tkey); } # Return the result. return $retVal; } =head3 Emergency Emergency($key, $hours, $dest, $level, @modules); Turn on emergency tracing. This method is normally invoked over the web from a debugging console, but it can also be called by the C<trace.pl> script. The caller specifies the duration of the emergency in hours, the desired tracing destination, the trace level, and a list of the trace modules to activate. For the length of the duration, when a program in an environment with the specified tracing key active invokes a Sprout CGI script, tracing will be turned on automatically. See L</TSetup> for more about tracing setup and L</ETracing> for more about emergency tracing. =over 4 =item tkey The tracing key. This is used to identify the control file and the trace file. =item hours Number of hours to keep emergency tracing alive. =item dest Tracing destination. If no path information is specified for a file destination, it is put in the FIG temporary directory. =item level Tracing level. A higher level means more trace messages. =item modules A list of the tracing modules to activate. =back =cut sub Emergency { # Get the parameters. my ($tkey, $hours, $dest, $level, @modules) = @_; # Create the emergency file. my $specFile = EmergencyFileName($tkey); my $outHandle = Open(undef, ">$specFile"); print $outHandle join("\n", $hours, $dest, $level, @modules, ""); } =head3 EmergencyKey my $tkey = EmergencyKey($parameter); Return the Key to be used for emergency tracing. This could be an IP address, a session ID, or a user name, depending on the environment. =over 4 =item parameter Parameter defining the method for finding the tracing key. If it is a scalar, then it is presumed to be the tracing key itself. If it is a CGI object, then the tracing key is taken from the C<IP> cookie. Otherwise, the tracing key is taken from the C<TRACING> environment variable. =item RETURN Returns the key to be used for labels in emergency tracing. =back =cut sub EmergencyKey { # Get the parameters. my ($parameter) = @_; # Declare the return variable. my $retVal; # Determine the parameter type. if (! defined $parameter) { # Here we're supposed to check the environment. $retVal = $ENV{TRACING}; } else { my $ptype = ref $parameter; if ($ptype eq 'CGI') { # Here we were invoked from a web page. Look for a cookie. $retVal = $parameter->cookie('IP'); } elsif (! $ptype) { # Here the key was passed in. $retVal = $parameter; } } # If no luck finding a key, use the PID. if (! defined $retVal) { $retVal = $$; } # Return the result. return $retVal; } =head3 TraceParms Tracer::TraceParms($cgi); Trace the CGI parameters at trace level CGI => 3 and the environment variables at level CGI => 4. A self-referencing URL is traced at level CGI => 2. =over 4 =item cgi CGI query object containing the parameters to trace. =back =cut sub TraceParms { # Get the parameters. my ($cgi) = @_; if (T(CGI => 2)) { # Here we trace the GET-style URL for the script. Trace("URL: " . $cgi->url(-relative => 1, -query => 1)); } if (T(CGI => 3)) { # Here we want to trace the parameter data. my @names = $cgi->param; for my $parmName (sort @names) { # Note we skip the Trace parameters, which are for our use only. if ($parmName ne 'Trace' && $parmName ne 'TF') { my @values = $cgi->param($parmName); Trace("CGI: $parmName = " . join(", ", @values)); } } # Display the request method. my $method = $cgi->request_method(); Trace("Method: $method"); } if (T(CGI => 4)) { # Here we want the environment data too. for my $envName (sort keys %ENV) { Trace("ENV: $envName = $ENV{$envName}"); } } } =head3 TraceImages Tracer::TraceImages($htmlString); Trace information about all of an html document's images. The tracing will be for type "IMG" at level 3. The image's source string will be displayed. This is generally either the URL of the image or raw data for the image itself. If the source is too long, only the first 300 characters will be shown at trace level 3. The entire source will be shown, however, at trace level 4. This method is not very smart, and might catch Javascript code, but it is still useful when debugging the arcane behavior of images in multiple browser environments. =over 4 =item htmlString HTML text for an outgoing web page. =back =cut sub TraceImages { # Only proceed if we're at the proper trace level. if (T(IMG => 3)) { # For performance reasons we're manipulating $_[0] instead of retrieving the string # into a variable called "$htmlString". This is because we expect html strings to be # long, and don't want to copy them any more than we have to. Trace(length($_[0]) . " characters in web page."); # Loop through the HTML, culling image tags. while ($_[0] =~ /<img\s+[^>]+?src="([^"]+)"/sgi) { # Extract the source string and determine whether or not it's too long. my $srcString = $1; my $pos = pos($_[0]) - length($srcString); my $excess = length($srcString) - 300; # We'll put the display string in here. my $srcDisplay = $srcString; # If it's a data string, split it at the comma. $srcDisplay =~ s/^(data[^,]+,)/$1\n/; # If there's no excess or we're at trace level 4, we're done. At level 3 with # a long string, however, we only show the first 300 characters. if ($excess > 0 && ! T(IMG => 4)) { $srcDisplay = substr($srcDisplay,0,300) . "\nplus $excess characters."; } # Output the trace message. Trace("Image tag at position $pos:\n$srcDisplay"); } } } =head3 ScriptFinish ScriptFinish($webData, $varHash); Output a web page at the end of a script. Either the string to be output or the name of a template file can be specified. If the second parameter is omitted, it is assumed we have a string to be output; otherwise, it is assumed we have the name of a template file. The template should have the variable C<DebugData> specified in any form that invokes a standard script. If debugging mode is turned on, a form field will be put in that allows the user to enter tracing data. Trace messages will be placed immediately before the terminal C<BODY> tag in the output, formatted as a list. A typical standard script would loook like the following. BEGIN { # Print the HTML header. print "CONTENT-TYPE: text/html\n\n"; } use Tracer; use CGI; use FIG; # ... more uses ... my ($cgi, $varHash) = ScriptSetup(); eval { # ... get data from $cgi, put it in $varHash ... }; if ($@) { Trace("Script Error: $@") if T(0); } ScriptFinish("Html/MyTemplate.html", $varHash); The idea here is that even if the script fails, you'll see trace messages and useful output. =over 4 =item webData A string containing either the full web page to be written to the output or the name of a template file from which the page is to be constructed. If the name of a template file is specified, then the second parameter must be present; otherwise, it must be absent. =item varHash (optional) If specified, then a reference to a hash mapping variable names for a template to their values. The template file will be read into memory, and variable markers will be replaced by data in this hash reference. =back =cut sub ScriptFinish { # Get the parameters. my ($webData, $varHash) = @_; # Check for a template file situation. my $outputString; if (defined $varHash) { # Here we have a template file. We need to determine the template type. my $template; if ($FIG_Config::template_url && $webData =~ /\.php$/) { $template = "$FIG_Config::template_url/$webData"; } else { $template = "<<$webData"; } $outputString = PageBuilder::Build($template, $varHash, "Html"); } else { # Here the user gave us a raw string. $outputString = $webData; } # Check for trace messages. if ($Destination ne "NONE" && $TraceLevel > 0) { # We have trace messages, so we want to put them at the end of the body. This # is either at the end of the whole string or at the beginning of the BODY # end-tag. my $pos = length $outputString; if ($outputString =~ m#</body>#gi) { $pos = (pos $outputString) - 7; } # If the trace messages were queued, we unroll them. Otherwise, we display the # destination. my $traceHtml; if ($Destination eq "QUEUE") { $traceHtml = QTrace('Html'); } elsif ($Destination =~ /^>>(.+)$/) { # Here the tracing output it to a file. We code it as a hyperlink so the user # can copy the file name into the clipboard easily. my $actualDest = $1; $traceHtml = "<p>Tracing output to $actualDest.</p>\n"; } else { # Here we have one of the special destinations. $traceHtml = "<P>Tracing output type is $Destination.</p>\n"; } substr $outputString, $pos, 0, $traceHtml; } # Write the output string. print $outputString; } =head2 Command-Line Utility Methods =head3 SendSMS my $msgID = Tracer::SendSMS($phoneNumber, $msg); Send a text message to a phone number using Clickatell. The FIG_Config file must contain the user name, password, and API ID for the relevant account in the hash reference variable I<$FIG_Config::phone>, using the keys C<user>, C<password>, and C<api_id>. For example, if the user name is C<BruceTheHumanPet>, the password is C<silly>, and the API ID is C<2561022>, then the FIG_Config file must contain $phone = { user => 'BruceTheHumanPet', password => 'silly', api_id => '2561022' }; The original purpose of this method was to insure Bruce would be notified immediately when the Sprout Load terminates. Care should be taken if you do not wish Bruce to be notified immediately when you call this method. The message ID will be returned if successful, and C<undef> if an error occurs. =over 4 =item phoneNumber Phone number to receive the message, in international format. A United States phone number would be prefixed by "1". A British phone number would be prefixed by "44". =item msg Message to send to the specified phone. =item RETURN Returns the message ID if successful, and C<undef> if the message could not be sent. =back =cut sub SendSMS { # Get the parameters. my ($phoneNumber, $msg) = @_; # Declare the return variable. If we do not change it, C<undef> will be returned. my $retVal; # Only proceed if we have phone support. if (! defined $FIG_Config::phone) { Trace("Phone support not present in FIG_Config.") if T(1); } else { # Get the phone data. my $parms = $FIG_Config::phone; # Get the Clickatell URL. my $url = "http://api.clickatell.com/http/"; # Create the user agent. my $ua = LWP::UserAgent->new; # Request a Clickatell session. my $resp = $ua->post("$url/sendmsg", { user => $parms->{user}, password => $parms->{password}, api_id => $parms->{api_id}, to => $phoneNumber, text => $msg}); # Check for an error. if (! $resp->is_success) { Trace("Alert failed.") if T(1); } else { # Get the message ID. my $rstring = $resp->content; if ($rstring =~ /^ID:\s+(.*)$/) { $retVal = $1; } else { Trace("Phone attempt failed with $rstring") if T(1); } } } # Return the result. return $retVal; } =head3 StandardSetup my ($options, @parameters) = StandardSetup(\@categories, \%options, $parmHelp, @ARGV); This method performs standard command-line parsing and tracing setup. The return values are a hash of the command-line options and a list of the positional parameters. Tracing is automatically set up and the command-line options are validated. This is a complex method that does a lot of grunt work. The parameters can be more easily understood, however, once they are examined individually. The I<categories> parameter is the most obtuse. It is a reference to a list of special-purpose tracing categories. Most tracing categories are PERL package names. So, for example, if you wanted to turn on tracing inside the B<Sprout>, B<ERDB>, and B<SproutLoad> packages, you would specify the categories ["Sprout", "SproutLoad", "ERDB"] This would cause trace messages in the specified three packages to appear in the output. There are two special tracing categories that are automatically handled by this method. In other words, if you used L</TSetup> you would need to include these categories manually, but if you use this method they are turned on automatically. =over 4 =item SQL Traces SQL commands and activity. =item Tracer Traces error messages and call stacks. =back C<SQL> is only turned on if the C<-sql> option is specified in the command line. The trace level is specified using the C<-trace> command-line option. For example, the following command line for C<TransactFeatures> turns on SQL tracing and runs all tracing at level 3. TransactFeatures -trace=3 -sql register ../xacts IDs.tbl Standard tracing is output to the standard output and echoed to the file C<trace>I<$$>C<.log> in the FIG temporary directory, where I<$$> is the process ID. You can also specify the C<user> parameter to put a user ID instead of a process ID in the trace file name. So, for example The default trace level is 2. To get all messages, specify a trace level of 4. For a genome-by-genome update, use 3. TransactFeatures -trace=3 -sql -user=Bruce register ../xacts IDs.tbl would send the trace output to C<traceBruce.log> in the temporary directory. The I<options> parameter is a reference to a hash containing the command-line options, their default values, and an explanation of what they mean. Command-line options may be in the form of switches or keywords. In the case of a switch, the option value is 1 if it is specified and 0 if it is not specified. In the case of a keyword, the value is separated from the option name by an equal sign. You can see this last in the command-line example above. You can specify a different default trace level by setting C<$options->{trace}> prior to calling this method. An example at this point would help. Consider, for example, the command-line utility C<TransactFeatures>. It accepts a list of positional parameters plus the options C<safe>, C<noAlias>, C<start>, and C<tblFiles>. To start up this command, we execute the following code. my ($options, @parameters) = Tracer::StandardSetup(["DocUtils"], { safe => [0, "use database transactions"], noAlias => [0, "do not expect aliases in CHANGE transactions"], start => [' ', "start with this genome"], tblFiles => [0, "output TBL files containing the corrected IDs"] }, "<command> <transactionDirectory> <IDfile>", @ARGV); The call to C<ParseCommand> specifies the default values for the options and stores the actual options in a hash that is returned as C<$options>. The positional parameters are returned in C<@parameters>. The following is a sample command line for C<TransactFeatures>. TransactFeatures -trace=2 -noAlias register ../xacts IDs.tbl Single and double hyphens are equivalent. So, you could also code the above command as TransactFeatures --trace=2 --noAlias register ../xacts IDs.tbl In this case, C<register>, C<../xacts>, and C<IDs.tbl> are the positional parameters, and would find themselves in I<@parameters> after executing the above code fragment. The tracing would be set to level 2, and the categories would be C<Tracer>, and <DocUtils>. C<Tracer> is standard, and C<DocUtils> was included because it came in within the first parameter to this method. The I<$options> hash would be { trace => 2, sql => 0, safe => 0, noAlias => 1, start => ' ', tblFiles => 0 } Use of C<StandardSetup> in this way provides a simple way of performing standard tracing setup and command-line parsing. Note that the caller is not even aware of the command-line switches C<-trace> and C<-sql>, which are used by this method to control the tracing. If additional tracing features need to be added in the future, they can be processed by this method without upsetting the command-line utilities. If the C<background> option is specified on the command line, then the standard and error outputs will be directed to files in the temporary directory, using the same suffix as the trace file. So, if the command line specified -user=Bruce -background then the trace output would go to C<traceBruce.log>, the standard output to C<outBruce.log>, and the error output to C<errBruce.log>. This is designed to simplify starting a command in the background. The user name is also used as the tracing key for L</Emergency Tracing>. Specifying a value of C<E> for the trace level causes emergency tracing to be used instead of custom tracing. If the user name is not specified, the tracing key is taken from the C<Tracing> environment variable. If there is no value for that variable, the tracing key will be computed from the PID. Finally, if the special option C<-help> is specified, the option names will be traced at level 0 and the program will exit without processing. This provides a limited help capability. For example, if the user enters TransactFeatures -help he would see the following output. TransactFeatures [options] <command> <transactionDirectory> <IDfile> -trace tracing level (default E) -sql trace SQL commands -safe use database transactions -noAlias do not expect aliases in CHANGE transactions -start start with this genome -tblFiles output TBL files containing the corrected IDs The caller has the option of modifying the tracing scheme by placing a value for C<trace> in the incoming options hash. The default value can be overridden, or the tracing to the standard output can be turned off by suffixing a minus sign to the trace level. So, for example, { trace => [0, "tracing level (default 0)"], ... would set the default trace level to 0 instead of E, while { trace => ["2-", "tracing level (default 2)"], ... would set the default to 2, but trace only to the log file, not to the standard output. The parameters to this method are as follows. =over 4 =item categories Reference to a list of tracing category names. These should be names of packages whose internal workings will need to be debugged to get the command working. =item options Reference to a hash containing the legal options for the current command mapped to their default values and descriptions. The user can override the defaults by specifying the options as command-line switches prefixed by a hyphen. Tracing-related options may be added to this hash. If the C<-h> option is specified on the command line, the option descriptions will be used to explain the options. To turn off tracing to the standard output, add a minus sign to the value for C<trace> (see above). =item parmHelp A string that vaguely describes the positional parameters. This is used if the user specifies the C<-h> option. =item argv List of command line parameters, including the option switches, which must precede the positional parameters and be prefixed by a hyphen. =item RETURN Returns a list. The first element of the list is the reference to a hash that maps the command-line option switches to their values. These will either be the default values or overrides specified on the command line. The remaining elements of the list are the position parameters, in order. =back =cut sub StandardSetup { # Get the parameters. my ($categories, $options, $parmHelp, @argv) = @_; # Get the default tracing key. my $tkey = EmergencyKey(); # Add the tracing options. if (! exists $options->{trace}) { $options->{trace} = ['2', "tracing level (E for emergency tracing)"]; } $options->{sql} = [0, "turn on SQL tracing"]; $options->{help} = [0, "display command-line options"]; $options->{user} = [$tkey, "tracing key"]; $options->{background} = [0, "spool standard and error output"]; # Create a parsing hash from the options hash. The parsing hash # contains the default values rather than the default value # and the description. While we're at it, we'll memorize the # length of the longest option name. my $longestName = 0; my %parseOptions = (); for my $key (keys %{$options}) { if (length $key > $longestName) { $longestName = length $key; } $parseOptions{$key} = $options->{$key}->[0]; } # Parse the command line. my ($retOptions, @retParameters) = ParseCommand(\%parseOptions, @argv); # Get the logfile suffix. my $suffix = $retOptions->{user}; # Check for background mode. if ($retOptions->{background}) { my $outFileName = "$FIG_Config::temp/out$suffix.log"; my $errFileName = "$FIG_Config::temp/err$suffix.log"; open STDOUT, ">$outFileName"; open STDERR, ">$errFileName"; # Check for phone support. If we have phone support and a phone number, # we want to turn it on. if ($ENV{PHONE} && defined($FIG_Config::phone)) { $retOptions->{phone} = $ENV{PHONE}; } } # Now we want to set up tracing. First, we need to know if the user # wants emergency tracing. if ($retOptions->{trace} eq 'E') { ETracing($retOptions->{user}); } else { # Here the tracing is controlled from the command line. my @cats = @{$categories}; if ($retOptions->{sql}) { push @cats, "SQL"; } # Add the default categories. push @cats, "Tracer"; # Next, we create the category string by joining the categories. my $cats = join(" ", @cats); # Check to determine whether or not the caller wants to turn off tracing # to the standard output. my $traceLevel = $retOptions->{trace}; my $textOKFlag = 1; if ($traceLevel =~ /^(.)-/) { $traceLevel = $1; $textOKFlag = 0; } # Now we set up the trace mode. my $traceMode; # Verify that we can open a file in the FIG temporary directory. my $traceFileName = "$FIG_Config::temp/trace$suffix.log"; if (open TESTTRACE, ">$traceFileName") { # Here we can trace to a file. $traceMode = ">$traceFileName"; if ($textOKFlag) { # Echo to standard output if the text-OK flag is set. $traceMode = "+$traceMode"; } # Close the test file. close TESTTRACE; } else { # Here we can't trace to a file. We trace to the standard output if it's # okay, and the error log otherwise. if ($textOKFlag) { $traceMode = "TEXT"; } else { $traceMode = "WARN"; } } # Now set up the tracing. TSetup("$traceLevel $cats", $traceMode); } # Check for the "help" option. If it is specified, dump the command-line # options and exit the program. if ($retOptions->{help}) { $0 =~ m#[/\\](\w+)(\.pl)?$#i; print "$1 [options] $parmHelp\n"; for my $key (sort keys %{$options}) { my $name = Pad($key, $longestName, 0, ' '); my $desc = $options->{$key}->[1]; if ($options->{$key}->[0]) { $desc .= " (default " . $options->{$key}->[0] . ")"; } print " $name $desc\n"; } exit(0); } # Trace the options, if applicable. if (T(3)) { my @parms = grep { $retOptions->{$_} } keys %{$retOptions}; Trace("Selected options: " . join(", ", sort @parms) . "."); } # Return the parsed parameters. return ($retOptions, @retParameters); } =head3 ReadOptions my %options = Tracer::ReadOptions($fileName); Read a set of options from a file. Each option is encoded in a line of text that has the format I<optionName>C<=>I<optionValue>C<; >I<comment> The option name must consist entirely of letters, digits, and the punctuation characters C<.> and C<_>, and is case sensitive. Blank lines and lines in which the first nonblank character is a semi-colon will be ignored. The return hash will map each option name to the corresponding option value. =over 4 =item fileName Name of the file containing the option data. =item RETURN Returns a hash mapping the option names specified in the file to their corresponding option value. =back =cut sub ReadOptions { # Get the parameters. my ($fileName) = @_; # Open the file. (open CONFIGFILE, "<$fileName") || Confess("Could not open option file $fileName."); # Count the number of records read. my ($records, $comments) = 0; # Create the return hash. my %retVal = (); # Loop through the file, accumulating key-value pairs. while (my $line = <CONFIGFILE>) { # Denote we've read a line. $records++; # Determine the line type. if ($line =~ /^\s*[\n\r]/) { # A blank line is a comment. $comments++; } elsif ($line =~ /^\s*([A-Za-z0-9_\.]+)=([^;]*);/) { # Here we have an option assignment. retVal{$1} = $2; } elsif ($line =~ /^\s*;/) { # Here we have a text comment. $comments++; } else { # Here we have an invalid line. Trace("Invalid option statement in record $records.") if T(0); } } # Return the hash created. return %retVal; } =head3 GetOptions Tracer::GetOptions(\%defaults, \%options); Merge a specified set of options into a table of defaults. This method takes two hash references as input and uses the data from the second to update the first. If the second does not exist, there will be no effect. An error will be thrown if one of the entries in the second hash does not exist in the first. Consider the following example. my $optionTable = GetOptions({ dbType => 'mySQL', trace => 0 }, $options); In this example, the variable B<$options> is expected to contain at most two options-- B<dbType> and B<trace>. The default database type is C<mySQL> and the default trace level is C<0>. If the value of B<$options> is C<< {dbType => 'Oracle'} >>, then the database type will be changed to C<Oracle> and the trace level will remain at 0. If B<$options> is undefined, then the database type and trace level will remain C<mySQL> and C<0>. If, on the other hand, B<$options> is defined as {databaseType => 'Oracle'} an error will occur because the B<databaseType> option does not exist. =over 4 =item defaults Table of default option values. =item options Table of overrides, if any. =item RETURN Returns a reference to the default table passed in as the first parameter. =back =cut sub GetOptions { # Get the parameters. my ($defaults, $options) = @_; # Check for overrides. if ($options) { # Loop through the overrides. while (my ($option, $setting) = each %{$options}) { # Insure this override exists. if (!exists $defaults->{$option}) { croak "Unrecognized option $option encountered."; } else { # Apply the override. $defaults->{$option} = $setting; } } } # Return the merged table. return $defaults; } =head3 MergeOptions Tracer::MergeOptions(\%table, \%defaults); Merge default values into a hash table. This method looks at the key-value pairs in the second (default) hash, and if a matching key is not found in the first hash, the default pair is copied in. The process is similar to L</GetOptions>, but there is no error- checking and no return value. =over 4 =item table Hash table to be updated with the default values. =item defaults Default values to be merged into the first hash table if they are not already present. =back =cut sub MergeOptions { # Get the parameters. my ($table, $defaults) = @_; # Loop through the defaults. while (my ($key, $value) = each %{$defaults}) { if (!exists $table->{$key}) { $table->{$key} = $value; } } } =head3 ParseCommand my ($options, @arguments) = Tracer::ParseCommand(\%optionTable, @inputList); Parse a command line consisting of a list of parameters. The initial parameters may be option specifiers of the form C<->I<option> or C<->I<option>C<=>I<value>. The options are stripped off and merged into a table of default options. The remainder of the command line is returned as a list of positional arguments. For example, consider the following invocation. my ($options, @arguments) = ParseCommand({ errors => 0, logFile => 'trace.log'}, @words); In this case, the list @words will be treated as a command line and there are two options available, B<errors> and B<logFile>. If @words has the following format -logFile=error.log apple orange rutabaga then at the end of the invocation, C<$options> will be { errors => 0, logFile => 'error.log' } and C<@arguments> will contain apple orange rutabaga The parser allows for some escape sequences. See L</UnEscape> for a description. There is no support for quote characters. Options can be specified with single or double hyphens. =over 4 =item optionTable Table of default options. =item inputList List of words on the command line. =item RETURN Returns a reference to the option table and a list of the positional arguments. =back =cut sub ParseCommand { # Get the parameters. my ($optionTable, @inputList) = @_; # Process any options in the input list. my %overrides = (); while ((@inputList > 0) && ($inputList[0] =~ /^--?/)) { # Get the current option. my $arg = shift @inputList; # Pull out the option name. $arg =~ /^--?([^=]*)/g; my $name = $1; # Check for an option value. if ($arg =~ /\G=(.*)$/g) { # Here we have a value for the option. $overrides{$name} = UnEscape($1); } else { # Here there is no value, so we use 1. $overrides{$name} = 1; } } # Merge the options into the defaults. GetOptions($optionTable, \%overrides); # Translate the remaining parameters. my @retVal = (); for my $inputParm (@inputList) { push @retVal, UnEscape($inputParm); } # Return the results. return ($optionTable, @retVal); } =head2 File Utility Methods =head3 GetFile my @fileContents = Tracer::GetFile($fileName); or my $fileContents = Tracer::GetFile($fileName); Return the entire contents of a file. In list context, line-ends are removed and each line is a list element. In scalar context, line-ends are replaced by C<\n>. =over 4 =item fileName Name of the file to read. =item RETURN In a list context, returns the entire file as a list with the line terminators removed. In a scalar context, returns the entire file as a string. If an error occurs opening the file, an empty list will be returned. =back =cut sub GetFile { # Get the parameters. my ($fileName) = @_; # Declare the return variable. my @retVal = (); # Open the file for input. my $handle = Open(undef, "<$fileName"); # Read the whole file into the return variable, stripping off any terminator # characters. my $lineCount = 0; while (my $line = <$handle>) { $lineCount++; $line = Strip($line); push @retVal, $line; } # Close it. close $handle; my $actualLines = @retVal; Trace("$actualLines lines read from file $fileName.") if T(File => 2); # Return the file's contents in the desired format. if (wantarray) { return @retVal; } else { return join "\n", @retVal; } } =head3 PutFile Tracer::PutFile($fileName, \@lines); Write out a file from a list of lines of text. =over 4 =item fileName Name of the output file. =item lines Reference to a list of text lines. The lines will be written to the file in order, with trailing new-line characters. Alternatively, may be a string, in which case the string will be written without modification. =back =cut sub PutFile { # Get the parameters. my ($fileName, $lines) = @_; # Open the output file. my $handle = Open(undef, ">$fileName"); # Count the lines written. if (ref $lines ne 'ARRAY') { # Here we have a scalar, so we write it raw. print $handle $lines; Trace("Scalar put to file $fileName.") if T(File => 3); } else { # Write the lines one at a time. my $count = 0; for my $line (@{$lines}) { print $handle "$line\n"; $count++; } Trace("$count lines put to file $fileName.") if T(File => 3); } # Close the output file. close $handle; } =head3 ParseRecord my @fields = Tracer::ParseRecord($line); Parse a tab-delimited data line. The data line is split into field values. Embedded tab and new-line characters in the data line must be represented as C<\t> and C<\n>, respectively. These will automatically be converted. =over 4 =item line Line of data containing the tab-delimited fields. =item RETURN Returns a list of the fields found in the data line. =back =cut sub ParseRecord { # Get the parameter. my ($line) = @_; # Remove the trailing new-line, if any. chomp $line; # Split the line read into pieces using the tab character. my @retVal = split /\t/, $line; # Trim and fix the escapes in each piece. for my $value (@retVal) { # Trim leading whitespace. $value =~ s/^\s+//; # Trim trailing whitespace. $value =~ s/\s+$//; # Delete the carriage returns. $value =~ s/\r//g; # Convert the escapes into their real values. $value =~ s/\\t/"\t"/ge; $value =~ s/\\n/"\n"/ge; } # Return the result. return @retVal; } =head3 Merge my @mergedList = Tracer::Merge(@inputList); Sort a list of strings and remove duplicates. =over 4 =item inputList List of scalars to sort and merge. =item RETURN Returns a list containing the same elements sorted in ascending order with duplicates removed. =back =cut sub Merge { # Get the input list in sort order. my @inputList = sort @_; # Only proceed if the list has at least two elements. if (@inputList > 1) { # Now we want to move through the list splicing out duplicates. my $i = 0; while ($i < @inputList) { # Get the current entry. my $thisEntry = $inputList[$i]; # Find out how many elements duplicate the current entry. my $j = $i + 1; my $dup1 = $i + 1; while ($j < @inputList && $inputList[$j] eq $thisEntry) { $j++; }; # If the number is nonzero, splice out the duplicates found. if ($j > $dup1) { splice @inputList, $dup1, $j - $dup1; } # Now the element at position $dup1 is different from the element before it # at position $i. We push $i forward one position and start again. $i++; } } # Return the merged list. return @inputList; } =head3 Open my $handle = Open($fileHandle, $fileSpec, $message); Open a file. The I<$fileSpec> is essentially the second argument of the PERL C<open> function. The mode is specified using Unix-like shell information. So, for example, Open(\*LOGFILE, '>>/usr/spool/news/twitlog', "Could not open twit log."); would open for output appended to the specified file, and Open(\*DATASTREAM, "| sort -u >$outputFile", "Could not open $outputFile."); would open a pipe that sorts the records written and removes duplicates. Note the use of file handle syntax in the Open call. To use anonymous file handles, code as follows. my $logFile = Open(undef, '>>/usr/spool/news/twitlog', "Could not open twit log."); The I<$message> parameter is used if the open fails. If it is set to C<0>, then the open returns TRUE if successful and FALSE if an error occurred. Otherwise, a failed open will throw an exception and the third parameter will be used to construct an error message. If the parameter is omitted, a standard message is constructed using the file spec. Could not open "/usr/spool/news/twitlog" Note that the mode characters are automatically cleaned from the file name. The actual error message from the file system will be captured and appended to the message in any case. Could not open "/usr/spool/news/twitlog": file not found. In some versions of PERL the only error message we get is a number, which corresponds to the C++ C<errno> value. Could not open "/usr/spool/news/twitlog": 6. =over 4 =item fileHandle File handle. If this parameter is C<undef>, a file handle will be generated and returned as the value of this method. =item fileSpec File name and mode, as per the PERL C<open> function. =item message (optional) Error message to use if the open fails. If omitted, a standard error message will be generated. In either case, the error information from the file system is appended to the message. To specify a conditional open that does not throw an error if it fails, use C<0>. =item RETURN Returns the name of the file handle assigned to the file, or C<undef> if the open failed. =back =cut sub Open { # Get the parameters. my ($fileHandle, $fileSpec, $message) = @_; # Attempt to open the file. my $rv = open $fileHandle, $fileSpec; # If the open failed, generate an error message. if (! $rv) { # Save the system error message. my $sysMessage = $!; # See if we need a default message. if (!$message) { # Clean any obvious mode characters and leading spaces from the # filename. my ($fileName) = FindNamePart($fileSpec); $message = "Could not open \"$fileName\""; } # Terminate with an error using the supplied message and the # error message from the file system. Confess("$message: $!"); } # Return the file handle. return $fileHandle; } =head3 FindNamePart my ($fileName, $start, $len) = Tracer::FindNamePart($fileSpec); Extract the portion of a file specification that contains the file name. A file specification is the string passed to an C<open> call. It specifies the file mode and name. In a truly complex situation, it can specify a pipe sequence. This method assumes that the file name is whatever follows the first angle bracket sequence. So, for example, in the following strings the file name is C</usr/fig/myfile.txt>. >>/usr/fig/myfile.txt </usr/fig/myfile.txt | sort -u > /usr/fig/myfile.txt If the method cannot find a file name using its normal methods, it will return the whole incoming string. =over 4 =item fileSpec File specification string from which the file name is to be extracted. =item RETURN Returns a three-element list. The first element contains the file name portion of the specified string, or the whole string if a file name cannot be found via normal methods. The second element contains the start position of the file name portion and the third element contains the length. =back =cut #: Return Type $; sub FindNamePart { # Get the parameters. my ($fileSpec) = @_; # Default to the whole input string. my ($retVal, $pos, $len) = ($fileSpec, 0, length $fileSpec); # Parse out the file name if we can. if ($fileSpec =~ m/(<|>>?)(.+?)(\s*)$/) { $retVal = $2; $len = length $retVal; $pos = (length $fileSpec) - (length $3) - $len; } # Return the result. return ($retVal, $pos, $len); } =head3 OpenDir my @files = OpenDir($dirName, $filtered, $flag); Open a directory and return all the file names. This function essentially performs the functions of an C<opendir> and C<readdir>. If the I<$filtered> parameter is set to TRUE, all filenames beginning with a period (C<.>), dollar sign (C<$>), or pound sign (C<#>) and all filenames ending with a tilde C<~>) will be filtered out of the return list. If the directory does not open and I<$flag> is not set, an exception is thrown. So, for example, my @files = OpenDir("/Volumes/fig/contigs", 1); is effectively the same as opendir(TMP, "/Volumes/fig/contigs") || Confess("Could not open /Volumes/fig/contigs."); my @files = grep { $_ !~ /^[\.\$\#]/ && $_ !~ /~$/ } readdir(TMP); Similarly, the following code my @files = grep { $_ =~ /^\d/ } OpenDir("/Volumes/fig/orgs", 0, 1); Returns the names of all files in C</Volumes/fig/orgs> that begin with digits and automatically returns an empty list if the directory fails to open. =over 4 =item dirName Name of the directory to open. =item filtered TRUE if files whose names begin with a period (C<.>) should be automatically removed from the list, else FALSE. =item flag TRUE if a failure to open is okay, else FALSE =back =cut #: Return Type @; sub OpenDir { # Get the parameters. my ($dirName, $filtered, $flag) = @_; # Declare the return variable. my @retVal = (); # Open the directory. if (opendir(my $dirHandle, $dirName)) { # The directory opened successfully. Get the appropriate list according to the # strictures of the filter parameter. if ($filtered) { @retVal = grep { $_ !~ /^[\.\$\#]/ && $_ !~ /~$/ } readdir $dirHandle; } else { @retVal = readdir $dirHandle; } } elsif (! $flag) { # Here the directory would not open and it's considered an error. Confess("Could not open directory $dirName."); } # Return the result. return @retVal; } =head3 Insure Insure($dirName, $chmod); Insure a directory is present. =over 4 =item dirName Name of the directory to check. If it does not exist, it will be created. =item chmod (optional) Security privileges to be given to the directory if it is created. =back =cut sub Insure { my ($dirName, $chmod) = @_; if (! -d $dirName) { Trace("Creating $dirName directory.") if T(2); eval { mkpath $dirName; # If we have permissions specified, set them here. if (defined($chmod)) { chmod $chmod, $dirName; } }; if ($@) { Confess("Error creating $dirName: $@"); } } } =head3 ChDir ChDir($dirName); Change to the specified directory. =over 4 =item dirName Name of the directory to which we want to change. =back =cut sub ChDir { my ($dirName) = @_; if (! -d $dirName) { Confess("Cannot change to directory $dirName: no such directory."); } else { Trace("Changing to directory $dirName.") if T(File => 4); my $okFlag = chdir $dirName; if (! $okFlag) { Confess("Error switching to directory $dirName."); } } } =head3 SetPermissions Tracer::SetPermissions($dirName, $group, $mask, %otherMasks); Set the permissions for a directory and all the files and folders inside it. In addition, the group ownership will be changed to the specified value. This method is more vulnerable than most to permission and compatability problems, so it does internal error recovery. =over 4 =item dirName Name of the directory to process. =item group Name of the group to be assigned. =item mask Permission mask. Bits that are C<1> in this mask will be ORed into the permission bits of any file or directory that does not already have them set to 1. =item otherMasks Map of search patterns to permission masks. If a directory name matches one of the patterns, that directory and all its members and subdirectories will be assigned the new pattern. For example, the following would assign 01664 to most files, but would use 01777 for directories named C<tmp>. Tracer::SetPermissions($dirName, 'fig', 01664, '^tmp$' => 01777); The list is ordered, so the following would use 0777 for C<tmp1> and 0666 for C<tmp>, C<tmp2>, or C<tmp3>. Tracer::SetPermissions($dirName, 'fig', 01664, '^tmp1' => 0777, '^tmp' => 0666); Note that the pattern matches are all case-insensitive, and only directory names are matched, not file names. =back =cut sub SetPermissions { # Get the parameters. my ($dirName, $group, $mask, @otherMasks) = @_; # Set up for error recovery. eval { # Switch to the specified directory. ChDir($dirName); # Get the group ID. my $gid = getgrnam($group); # Get the mask for tracing. my $traceMask = sprintf("%04o", $mask) . "($mask)"; Trace("Fixing permissions for directory $dirName using group $group($gid) and mask $traceMask.") if T(File => 2); my $fixCount = 0; my $lookCount = 0; # @dirs will be a stack of directories to be processed. my @dirs = (getcwd()); while (scalar(@dirs) > 0) { # Get the current directory. my $dir = pop @dirs; # Check for a match to one of the specified directory names. To do # that, we need to pull the individual part of the name off of the # whole path. my $simpleName = $dir; if ($dir =~ m!/([^/]+)$!) { $simpleName = $1; } Trace("Simple directory name for $dir is $simpleName.") if T(File => 4); # Search for a match. my $match = 0; my $i; for ($i = 0; $i < $#otherMasks && ! $match; $i += 2) { my $pattern = $otherMasks[$i]; if ($simpleName =~ /$pattern/i) { $match = 1; } } # Check for a match. Note we use $i-1 because the loop added 2 # before terminating due to the match. if ($match && $otherMasks[$i-1] != $mask) { # This directory matches one of the incoming patterns, and it's # a different mask, so we process it recursively with that mask. SetPermissions($dir, $group, $otherMasks[$i-1], @otherMasks); } else { # Here we can process normally. Get all of the non-hidden members. my @submems = OpenDir($dir, 1); for my $submem (@submems) { # Get the full name. my $thisMem = "$dir/$submem"; Trace("Checking member $thisMem.") if T(4); $lookCount++; if ($lookCount % 1000 == 0) { Trace("$lookCount members examined. Current is $thisMem. Mask is $traceMask") if T(File => 3); } # Fix the group. chown -1, $gid, $thisMem; # Insure this member is not a symlink. if (! -l $thisMem) { # Get its info. my $fileInfo = stat $thisMem; # Only proceed if we got the info. Otherwise, it's a hard link # and we want to skip it anyway. if ($fileInfo) { my $fileMode = $fileInfo->mode; if (($fileMode & $mask) != $mask) { # Fix this member. $fileMode |= $mask; chmod $fileMode, $thisMem; $fixCount++; } # If it's a subdirectory, stack it. if (-d $thisMem) { push @dirs, $thisMem; } } } } } } Trace("$lookCount files and directories processed, $fixCount fixed.") if T(File => 2); }; # Check for an error. if ($@) { Confess("SetPermissions error: $@"); } } =head3 GetLine my @data = Tracer::GetLine($handle); Read a line of data from a tab-delimited file. =over 4 =item handle Open file handle from which to read. =item RETURN Returns a list of the fields in the record read. The fields are presumed to be tab-delimited. If we are at the end of the file, then an empty list will be returned. If an empty line is read, a single list item consisting of a null string will be returned. =back =cut sub GetLine { # Get the parameters. my ($handle) = @_; # Declare the return variable. my @retVal = (); Trace("File position is " . tell($handle) . ". EOF flag is " . eof($handle) . ".") if T(File => 4); # Read from the file. my $line = <$handle>; # Only proceed if we found something. if (defined $line) { # Remove the new-line. We are a bit over-cautious here because the file may be coming in via an # upload control and have a nonstandard EOL combination. $line =~ s/(\r|\n)+$//; # Here we do some fancy tracing to help in debugging complicated EOL marks. if (T(File => 4)) { my $escapedLine = $line; $escapedLine =~ s/\n/\\n/g; $escapedLine =~ s/\r/\\r/g; $escapedLine =~ s/\t/\\t/g; Trace("Line read: -->$escapedLine<--"); } # If the line is empty, return a single empty string; otherwise, parse # it into fields. if ($line eq "") { push @retVal, ""; } else { push @retVal, split /\t/,$line; } } else { # Trace the reason the read failed. Trace("End of file: $!") if T(File => 3); } # Return the result. return @retVal; } =head3 PutLine Tracer::PutLine($handle, \@fields, $eol); Write a line of data to a tab-delimited file. The specified field values will be output in tab-separated form, with a trailing new-line. =over 4 =item handle Output file handle. =item fields List of field values. =item eol (optional) End-of-line character (default is "\n"). =back =cut sub PutLine { # Get the parameters. my ($handle, $fields, $eol) = @_; # Write the data. print $handle join("\t", @{$fields}) . ($eol || "\n"); } =head2 Other Useful Methods =head3 ParseParm my $listValue = Tracer::ParseParm($string); Convert a parameter into a list reference. If the parameter is undefined, an undefined value will be returned. Otherwise, it will be parsed as a comma-separated list of values. =over 4 =item string Incoming string. =item RETURN Returns a reference to a list of values, or C<undef> if the incoming value was undefined. =back =cut sub ParseParm { # Get the parameters. my ($string) = @_; # Declare the return variable. my $retVal; # Check for data. if (defined $string) { # We have some, so split it into a list. $retVal = [ split /\s*,\s*/, $string]; } # Return the result. return $retVal; } =head3 Now my $string = Tracer::Now(); Return a displayable time stamp containing the local time. =cut sub Now { my ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime(time); my $retVal = _p2($mon+1) . "/" . _p2($mday) . "/" . ($year + 1900) . " " . _p2($hour) . ":" . _p2($min) . ":" . _p2($sec); return $retVal; } # Pad a number to 2 digits. sub _p2 { my ($value) = @_; $value = "0$value" if ($value < 10); return $value; } =head3 Escape my $codedString = Tracer::Escape($realString); Escape a string for use in a command. Tabs will be replaced by C<\t>, new-lines replaced by C<\n>, carriage returns will be deleted, and backslashes will be doubled. The result is to reverse the effect of L</UnEscape>. =over 4 =item realString String to escape. =item RETURN Escaped equivalent of the real string. =back =cut sub Escape { # Get the parameter. my ($realString) = @_; # Initialize the return variable. my $retVal = ""; # Loop through the parameter string, looking for sequences to escape. while (length $realString > 0) { # Look for the first sequence to escape. if ($realString =~ /^(.*?)([\n\t\r\\])/) { # Here we found it. The text preceding the sequence is in $1. The sequence # itself is in $2. First, move the clear text to the return variable. $retVal .= $1; # Strip the processed section off the real string. $realString = substr $realString, (length $2) + (length $1); # Get the matched character. my $char = $2; # If we have a CR, we are done. if ($char ne "\r") { # It's not a CR, so encode the escape sequence. $char =~ tr/\t\n/tn/; $retVal .= "\\" . $char; } } else { # Here there are no more escape sequences. The rest of the string is # transferred unmodified. $retVal .= $realString; $realString = ""; } } # Return the result. return $retVal; } =head3 UnEscape my $realString = Tracer::UnEscape($codedString); Replace escape sequences with their actual equivalents. C<\t> will be replaced by a tab, C<\n> by a new-line character, and C<\\> by a backslash. C<\r> codes will be deleted. =over 4 =item codedString String to un-escape. =item RETURN Returns a copy of the original string with the escape sequences converted to their actual values. =back =cut sub UnEscape { # Get the parameter. my ($codedString) = @_; # Initialize the return variable. my $retVal = ""; # Only proceed if the incoming string is nonempty. if (defined $codedString) { # Loop through the parameter string, looking for escape sequences. We can't do # translating because it causes problems with the escaped slash. ("\\t" becomes # "\<tab>" no matter what we do.) while (length $codedString > 0) { # Look for the first escape sequence. if ($codedString =~ /^(.*?)\\(\\|n|t|r)/) { # Here we found it. The text preceding the sequence is in $1. The sequence # itself is in $2. First, move the clear text to the return variable. $retVal .= $1; $codedString = substr $codedString, (2 + length $1); # Get the escape value. my $char = $2; # If we have a "\r", we are done. if ($char ne 'r') { # Here it's not an 'r', so we convert it. $char =~ tr/\\tn/\\\t\n/; $retVal .= $char; } } else { # Here there are no more escape sequences. The rest of the string is # transferred unmodified. $retVal .= $codedString; $codedString = ""; } } } # Return the result. return $retVal; } =head3 Percent my $percent = Tracer::Percent($number, $base); Returns the percent of the base represented by the given number. If the base is zero, returns zero. =over 4 =item number Percent numerator. =item base Percent base. =item RETURN Returns the percentage of the base represented by the numerator. =back =cut sub Percent { # Get the parameters. my ($number, $base) = @_; # Declare the return variable. my $retVal = 0; # Compute the percent. if ($base != 0) { $retVal = $number * 100 / $base; } # Return the result. return $retVal; } =head3 Min my $min = Min($value1, $value2, ... $valueN); Return the minimum argument. The arguments are treated as numbers. =over 4 =item $value1, $value2, ... $valueN List of numbers to compare. =item RETURN Returns the lowest number in the list. =back =cut sub Min { # Get the parameters. Note that we prime the return value with the first parameter. my ($retVal, @values) = @_; # Loop through the remaining parameters, looking for the lowest. for my $value (@values) { if ($value < $retVal) { $retVal = $value; } } # Return the minimum found. return $retVal; } =head3 Max my $max = Max($value1, $value2, ... $valueN); Return the maximum argument. The arguments are treated as numbers. =over 4 =item $value1, $value2, ... $valueN List of numbers to compare. =item RETURN Returns the highest number in the list. =back =cut sub Max { # Get the parameters. Note that we prime the return value with the first parameter. my ($retVal, @values) = @_; # Loop through the remaining parameters, looking for the highest. for my $value (@values) { if ($value > $retVal) { $retVal = $value; } } # Return the maximum found. return $retVal; } =head3 DebugMode if (Tracer::DebugMode) { ...code... } Return TRUE if debug mode has been turned on, else abort. Certain CGI scripts are too dangerous to exist in the production environment. This method provides a simple way to prevent them from working unless they are explicitly turned on by creating a password cookie via the B<SetPassword> script. If debugging mode is not turned on, an error will occur. =cut sub DebugMode { # Declare the return variable. my $retVal = 0; # Check the debug configuration. my $password = CGI::cookie("DebugMode"); my $encrypted = Digest::MD5::md5_hex($password); if ($encrypted eq "252dec43280e0c0d6a75ffcec486e61d") { $retVal = 1; } else { # Here debug mode is off, so we generate an error. Confess("Cannot use this facility without logging in."); } # Return the determination indicator. return $retVal; } =head3 Strip my $string = Tracer::Strip($line); Strip all line terminators off a string. This is necessary when dealing with files that may have been transferred back and forth several times among different operating environments. =over 4 =item line Line of text to be stripped. =item RETURN The same line of text with all the line-ending characters chopped from the end. =back =cut sub Strip { # Get a copy of the parameter string. my ($string) = @_; my $retVal = (defined $string ? $string : ""); # Strip the line terminator characters. $retVal =~ s/(\r|\n)+$//g; # Return the result. return $retVal; } =head3 Pad my $paddedString = Tracer::Pad($string, $len, $left, $padChar); Pad a string to a specified length. The pad character will be a space, and the padding will be on the right side unless specified in the third parameter. =over 4 =item string String to be padded. =item len Desired length of the padded string. =item left (optional) TRUE if the string is to be left-padded; otherwise it will be padded on the right. =item padChar (optional) Character to use for padding. The default is a space. =item RETURN Returns a copy of the original string with the pad character added to the specified end so that it achieves the desired length. =back =cut sub Pad { # Get the parameters. my ($string, $len, $left, $padChar) = @_; # Compute the padding character. if (! defined $padChar) { $padChar = " "; } # Compute the number of spaces needed. my $needed = $len - length $string; # Copy the string into the return variable. my $retVal = $string; # Only proceed if padding is needed. if ($needed > 0) { # Create the pad string. my $pad = $padChar x $needed; # Affix it to the return value. if ($left) { $retVal = $pad . $retVal; } else { $retVal .= $pad; } } # Return the result. return $retVal; } =head3 EOF This is a constant that is lexically greater than any useful string. =cut sub EOF { return "\xFF\xFF\xFF\xFF\xFF"; } =head3 TICK my @results = TICK($commandString); Perform a back-tick operation on a command. If this is a Windows environment, any leading dot-slash (C<./> will be removed. So, for example, if you were doing `./protein.cgi` from inside a CGI script, it would work fine in Unix, but would issue an error message in Windows complaining that C<'.'> is not a valid command. If instead you code TICK("./protein.cgi") it will work correctly in both environments. =over 4 =item commandString The command string to pass to the system. =item RETURN Returns the standard output from the specified command, as a list. =back =cut #: Return Type @; sub TICK { # Get the parameters. my ($commandString) = @_; # Chop off the dot-slash if this is Windows. if ($FIG_Config::win_mode) { $commandString =~ s!^\./!!; } # Activate the command and return the result. return `$commandString`; } =head3 CommaFormat my $formatted = Tracer::CommaFormat($number); Insert commas into a number. =over 4 =item number A sequence of digits. =item RETURN Returns the same digits with commas strategically inserted. =back =cut sub CommaFormat { # Get the parameters. my ($number) = @_; # Pad the length up to a multiple of three. my $padded = "$number"; $padded = " " . $padded while length($padded) % 3 != 0; # This is a fancy PERL trick. The parentheses in the SPLIT pattern # cause the delimiters to be included in the output stream. The # GREP removes the empty strings in between the delimiters. my $retVal = join(",", grep { $_ ne '' } split(/(...)/, $padded)); # Clean out the spaces. $retVal =~ s/ //g; # Return the result. return $retVal; } =head3 CompareLists my ($inserted, $deleted) = Tracer::CompareLists(\@newList, \@oldList, $keyIndex); Compare two lists of tuples, and return a hash analyzing the differences. The lists are presumed to be sorted alphabetically by the value in the $keyIndex column. The return value contains a list of items that are only in the new list (inserted) and only in the old list (deleted). =over 4 =item newList Reference to a list of new tuples. =item oldList Reference to a list of old tuples. =item keyIndex (optional) Index into each tuple of its key field. The default is 0. =item RETURN Returns a 2-tuple consisting of a reference to the list of items that are only in the new list (inserted) followed by a reference to the list of items that are only in the old list (deleted). =back =cut sub CompareLists { # Get the parameters. my ($newList, $oldList, $keyIndex) = @_; if (! defined $keyIndex) { $keyIndex = 0; } # Declare the return variables. my ($inserted, $deleted) = ([], []); # Loop through the two lists simultaneously. my ($newI, $oldI) = (0, 0); my ($newN, $oldN) = (scalar @{$newList}, scalar @{$oldList}); while ($newI < $newN || $oldI < $oldN) { # Get the current object in each list. Note that if one # of the lists is past the end, we'll get undef. my $newItem = $newList->[$newI]; my $oldItem = $oldList->[$oldI]; if (! defined($newItem) || defined($oldItem) && $newItem->[$keyIndex] gt $oldItem->[$keyIndex]) { # The old item is not in the new list, so mark it deleted. push @{$deleted}, $oldItem; $oldI++; } elsif (! defined($oldItem) || $oldItem->[$keyIndex] gt $newItem->[$keyIndex]) { # The new item is not in the old list, so mark it inserted. push @{$inserted}, $newItem; $newI++; } else { # The item is in both lists, so push forward. $oldI++; $newI++; } } # Return the result. return ($inserted, $deleted); } =head3 GenerateURL my $queryUrl = Tracer::GenerateURL($page, %parameters); Generate a GET-style URL for the specified page with the specified parameter names and values. The values will be URL-escaped automatically. So, for example Tracer::GenerateURL("form.cgi", type => 1, string => "\"high pass\" or highway") would return form.cgi?type=1;string=%22high%20pass%22%20or%20highway =over 4 =item page Page URL. =item parameters Hash mapping parameter names to parameter values. =item RETURN Returns a GET-style URL that goes to the specified page and passes in the specified parameters and values. =back =cut sub GenerateURL { # Get the parameters. my ($page, %parameters) = @_; # Prime the return variable with the page URL. my $retVal = $page; # Loop through the parameters, creating parameter elements in a list. my @parmList = map { "$_=" . uri_escape($parameters{$_}) } keys %parameters; # If the list is nonempty, tack it on. if (@parmList) { $retVal .= "?" . join(";", @parmList); } # Return the result. return $retVal; } =head3 ApplyURL Tracer::ApplyURL($table, $target, $url); Run through a two-dimensional table (or more accurately, a list of lists), converting the I<$target> column to HTML text having a hyperlink to a URL in the I<$url> column. The URL column will be deleted by this process and the target column will be HTML-escaped. This provides a simple way to process the results of a database query into something displayable by combining a URL with text. =over 4 =item table Reference to a list of lists. The elements in the containing list will be updated by this method. =item target The index of the column to be converted into HTML. =item url The index of the column containing the URL. Note that the URL must have a recognizable C<http:> at the beginning. =back =cut sub ApplyURL { # Get the parameters. my ($table, $target, $url) = @_; # Loop through the table. for my $row (@{$table}) { # Apply the URL to the target cell. $row->[$target] = CombineURL($row->[$target], $row->[$url]); # Delete the URL from the row. delete $row->[$url]; } } =head3 CombineURL my $combinedHtml = Tracer::CombineURL($text, $url); This method will convert the specified text into HTML hyperlinked to the specified URL. The hyperlinking will only take place if the URL looks legitimate: that is, it is defined and begins with an C<http:> header. =over 4 =item text Text to return. This will be HTML-escaped automatically. =item url A URL to be hyperlinked to the text. If it does not look like a URL, then the text will be returned without any hyperlinking. =item RETURN Returns the original text, HTML-escaped, with the URL hyperlinked to it. If the URL doesn't look right, the HTML-escaped text will be returned without any further modification. =back =cut sub CombineURL { # Get the parameters. my ($text, $url) = @_; # Declare the return variable. my $retVal = CGI::escapeHTML($text); # Verify the URL. if (defined($url) && $url =~ m!http://!i) { # It's good, so we apply it to the text. $retVal = "<a href=\"$url\">$retVal</a>"; } # Return the result. return $retVal; } =head3 Cmp my $cmp = Tracer::Cmp($a, $b); This method performs a universal sort comparison. Each value coming in is separated into a leading text part and a trailing number part. The text part is string compared, and if both parts are equal, then the number parts are compared numerically. A stream of just numbers or a stream of just strings will sort correctly, and a mixed stream will sort with the numbers first. Strings with a label and a number will sort in the expected manner instead of lexically. =over 4 =item a First item to compare. =item b Second item to compare. =item RETURN Returns a negative number if the first item should sort first (is less), a positive number if the first item should sort second (is greater), and a zero if the items are equal. =back =cut sub Cmp { # Get the parameters. my ($a, $b) = @_; # Declare the return value. my $retVal; # Check for nulls. if (! defined($a)) { $retVal = (! defined($b) ? 0 : -1); } elsif (! defined($b)) { $retVal = 1; } else { # Here we have two real values. Parse the two strings. $a =~ /^(\D*)(\d*)$/; my $aParsed = [$1, $2]; $b =~ /^(\D*)(\d*)$/; my $bParsed = [$1, $2]; # Compare the string parts. $retVal = $aParsed->[0] cmp $bParsed->[0]; if (! $retVal) { $retVal = $aParsed->[1] <=> $bParsed->[1]; } } # Return the result. return $retVal; } 1;
MCS Webmaster | ViewVC Help |
Powered by ViewVC 1.0.3 |