General Help

What is StatCrunch?

StatCrunch is a statistical data analysis package for the World Wide Web. It is written in the form of a Java applet. We think users will find it easy to use, and we hope they enjoy working with our package!

Who we are

StatCrunch was created and programmed by a team of programmers and statisticians led byWebster West. Dr. West is in the Department of Statistics at Texas A&M University. The package was created as an initial attempt to solve many of the problems that exist with the delivery and use of modern statistical software. Many times statisticians develop procedures in languages such as Splus, SAS, Minitab, etc.., which are very specific to statisticians. Students and other potential users may not have access to these packages, and therefore may not be able to use the procedures. By using Java and the World Wide Web, StatCrunch should reach the broadest possible audience of any statistical software of its kind.

Getting started

StatCrunch should run on any of the three major platforms (Mac, PC, Unix). It only has the minimal requirement of a Java-capable Web browser which almost everyone on the Web now has. If you do not have a Java-enabled browser, you will probably see a gray box which may or may not have a red x in it after logging in to StatCrunch. A test to determine if a browser is Java-enabled is given below:

Java -

If the test above indicates that Java is not enabled, perform one of the following:

 
Explorer 5.x
 
Explorer 4.x
 
Communicator 4.x
 
Navigator 3.x

Using StatCrunch

The Data, Stat, Graphics and Help menus, located at the top of the StatCrunch frame, provide users with access to the analysis procedures of the software. The Help menu is linked to the StatCrunch help page. See the Data, Stat and Graphics help pages for a listing of these procedures and instructions on how to use them.

The dataset to be analyzed is displayed inside the data table located below the menu bar. StatCrunch offers a variety of methods for loading data. After loading data and selecting a menu item, a listing of the available procedures will appear in a new window. A dialog box will appear after selecting one of these procedures. In the dialog box will be a ? button which directly links the user to the relevant help information for that procedure. After making selections within the dialog box, the results of the procedure will appear in the window.


Saving and Printing Results

To copy, save or print StatCrunch results, you will first need to export the result to HTML. First, select the Export option under the Options menu of the result window. The graphics in the output are written as GIF files on the StatCrunch server, so this may take a few moments for results that contain a large number of graphics. With the latest StatCrunch interface, the results in HTML format will be displayed in the frame below the data table. In older versions of the interface, the results appear in pop up windows (which may be blocked if you have a pop up blocker turned on in your browser). In either case, use the browser's File menu to print your results. In most browsers, you can also copy selected graphics and/or text to the clipboard by choosing the Copy option under the Edit menu of your browser or by right clicking your mouse in the frame containing the HTML results. It is important to remember that the graphics links in the HTML file are to the graphics stored on the StatCrunch server. The file names for graphics are encoded with random letters so that only individuals who have the exact file names will have access to them. Individual graphics may be saved by clicking on the graphic in the new window and then using the browser's File menu to save or print it. If graphics are downloaded to a local file system, the IMG tags in the HTML file must be edited to indicate the proper path to the graphics files on the local system.

Including StatCrunch in a web page

Feel free to link to the StatCrunch site using the following syntax:

<A href="http://www.statcrunch.com/">StatCrunch</A>


Linking Data

Using the Link Generator Form, an HTML link can be created so that a specified data file on the web will be automatically loaded into StatCrunch when the link is clicked. Both text and Excel files can be linked.
  1. Simply specify the text of the link to be displayed on the web page (e.g., "My Data File").
  2. Specify the WWW address of the dataset to be loaded (e.g., http://www.myData.com/myData.txt).
  3. If the first line of the data file contains variable names, check the Use first line as variable names option.
  4. If the data file is a text file, specify the delimiter for the observations. The delimiter options are whitespace (any whitespace character such as a space or tab), tab, comma (for .csv files) and semicolon.
As an example, the Excel data file located at http://www.stat.sc.edu/~west/hotdog.xls can be accessed by clicking the following link: Hotdog Data

Using WHERE

When selecting data to be used with the various analysis procedures, a WHERE statement can be used to determine which rows from the data table will be included in the analysis. The Where statement provides an excellent way to isolate a subgroup within the data for analysis. The statement should be a valid boolean expression which evaluates to either a true or false value. The expression will be evaluated for each row in the data set, and only rows where the expression evaluates to true will be included in the analysis. See the section on expressions below for more information on constructing boolean expressions. Example syntax for Where statements, using the Hotdog Data, are given below.
Calories=190
includes rows where the Calories column is equal to 190
Calories>150
includes rows where the Calories column is greater than 150
Calories>=150
includes rows where the Calories column is greater than or equal to 150
Calories<>190
includes rows where the Calories column is not equal to 190
Calories!=190
includes rows where the Calories column is not equal to 190
LOG(Calories)>5
includes rows where the natural logarithm of the Calories column is greater than 5
Type=Meat
includes rows where the text in the Type column is Meat.
Type="Meat"
includes rows where the text in the Type column is Meat. Note that it is only necessary to use double quotes when the text string contains spaces.
Type<>Beef
includes rows where the text in the Type column is not Beef.
Sodium=386 AND Type=Meat
includes rows where the Sodium column is equal to 386 and the Type column is Meat
Sodium<=400 OR Type="Meat"
includes rows where the Sodium column is less than or equal to 400 or the Type column is Meat
(Sodium>=400 OR Sodium<=500) AND Type="Meat"
includes rows where the Sodium column is between 400 and 500 and the Type column is Meat
row=5
includes only the 5th row
row>=3 AND row<=10
includes rows 3 through 10

Expressions

Some StatCrunch procedures allow the user to input either a boolean (true/false) or mathematical expression. See the compute expression section to see examples of mathematical expressions. See the WHERE section for examples of boolean expressions used to control the data rows that are included in an analysis. Notes on using expressions: