The program runs on any platform supporting Java2, e.g. Windows (XP,2000,98,95), Linux, Irix 6.3, Tru64 Unix, Solaris, Mac (OS X), etc.
The program package consists of three parts:
An overview of the flow of information is shown in the Figure below.

First you need to install the Java Runtime Environment (JRE) on your computer – it is available free of charge for all the platforms I know of. If only the Software Development Kit (SDK) is available – no problem – the JRE is included in the SDK.
On the various Unix platforms that might require a super user password. Ask your local system administrator for help.
On Windows, you can install the JRE without administrator rights.
For Windows, Linux and Solaris, the JRE is available at: http://java.sun.com/j2se/downloads.html (click on the J2SE 1.4.0 or later link).
For Tru64 Unix (Compaq), the JRE is available at: http://www.compaq.com/java/download/index.html
For Irix (Silicon Graphics Unix), the JRE is available at: http://www.sgi.com/developers/devtools/languages/java.html
Download the indonesia.jar file here (download in
Netscape – click with the right mouse button and choose Save
link as – download in Internet Explorer – right click on the link and choose
Save target as). NOTE! If you are using MAC OS 10.2 or an older version of the Java Runtime,
you may want to install this version of the indonesia.jar
file instead.
Put the file in a folder called something like: c:\indonesia
on Windows or /usr/local/
Create a small script (batch file) that will launch the program.
On Windows call the file indonesia.bat:
java –jar c:\indonesia\indonesia.jar
On Unix call the file indonesia.sh:
#!/bin/tcsh
java –jar
/usr/local/Indonesia/indonesia.jar
On Unix you should furthermore make the file executable with: chmod a+x indonesia.sh.
When you start
By clicking on “Open a sequence file” a window pops up and you can choose one or more (ctrl+left mouse click selects the next file etc.).
When you have selected the file(s) (in all formats except PDB files that will be treated separately) the sequences are read and the individual sequences are shown in a window.
You may select and deselect the sequences that you actually want to align. You may also repeat the procedure and open more sequence files to add sequences to your list. Or you can start all over by clicking ‘delete sequence list’.
You can also add new sequences in the text window: Delete the text “Replace this text with your sequences” and write your sequences in the window (mouse right click gives you access to paste the contents of the clipboard from e.g. Word into the text window). Separate sequences with an asterix, *. Click “add the sequence(s) in the textbox to the list”.
In the
The last two checkboxes in the
Finally you may click the “align sequences”
to align the sequences from scratch. If you have loaded a file with pre-aligned
sequences and you want to use
The mouse actions in the AEW are summarized
in the table below. On a two button mouse, the middle mouse button is usually
emulated by pressing both left and right mouse button. One button mouse – hmm?
|
Left-click in alignment |
Get information on a residue. |
|
Right-click in alignment |
Lock column. |
|
Middle-click in alignment |
Create all-gap column. |
|
Left-click on sequence name. |
Select sequence name (for moving or deleting sequences – with the arrow keys and the delete key) – several sequence names can be selected simultaneously. |
|
Drag in alignment (any mouse button pressed) |
Drags sequence (any sequences selected will also move). |
|
Drag in the area above the alignment (left button) |
Set helix/strand annotation to alpha-helix. |
|
Drag in the area above the alignment (right button) |
Set helix/strand annotation to beta-strand. |
|
Drag in the area above the alignment
(middle button) |
Delete annotation. |
When the sequences have been aligned (or just a display of pre-aligned sequences), the alignment editor window appears.
In the AEW, you can save the alignment in sequence formats FASTA, PIR and CLUSTAL or in PostScript format (menu File->save or save as).
By clicking with the left mouse button on any residue in the alignment, you will get the sequence it belongs to and the position in the sequence.
You can copy the alignment to the clipboard in various forms (full alignment, just sequences etc.) and paste it into e.g. a Word document or any other program (menu Edit).
You can enter the free edit mode (menu Edit->Edit Alignment) – and get out of the free edit mode the by selecting the menu item again.
You can change the order of the sequences or delete one or more sequences. By clicking with the left mouse button on the sequence name, the sequence is selected (this can be repeated). If you press the up or down arrow keys, the sequence(s) with move up or down. If you press the delete key, the sequence(s) will be deleted. Be careful – there is NO undo!
An important feature is the ability to drag the sequences with the left mouse button in the AEW. Two different drag/push modes exist: one where the amino acid residues behave like pearls on a string and one where the sequence behave like a rigid rod so that inserting a gap will move the whole sequence. You can toggle between the two drag/push modes in the menu item (Edit->Sequences move like pearls on a string).
You can lock any of the columns in the alignment by clicking with the right mouse button in the column. A small lock appears under the column.
When you have edited the alignment to correct for errors and maybe locked the columns you know should be aligned (like a protein family fingerprint), you can re-iterate the alignment in the menu (Alignment->Iterate!). After a while, the alignment appears in a new AEW (there is a number on each AEW so that you can see which is which).
In the AEW, you can LOAD new sequences to the existing alignment (File->Load sequences). After the sequences have been loaded a new AEW appears. The new sequences are not aligned – so press (Alignment->Iterate!) to align the new sequences to the existing alignment.
In the alignment menu item, you can also calculate the identity and similarity matrix (see description above) (menu Alignment->Calc. Seq. similarity).
If you have edited the alignment some columns with just gaps might have been introduced, they can be removed (menu Alignment->Remove gap columns).
For almost all of the annotation, you can choose to show it when you save the alignment in PostScript format - except the entropy display in each column.
In the AEW, you have many different options to annotate your alignment. To calculate the consensus sequence, you input the threshold for a column to be in consensus (between 0.51 and 1.0 – i.e. 51% and 100%). In the menu select (Annotation->Calculate consensus). The consensus sequence is shown below the alignment.
You also have the option to enter a line with your own comments – like signs, letters or numbers that you might want to refer to in the text in a paper. In the menu select (Annotation->Edit own comment) to edit the line and select it again to leave it as it is. The delete and backspace keys on the keyboard works.
You can have up to two different numbering schemes for the alignment. And each of them can be numbered just following the columns in the alignment or follow the amino acid residue numbering in any of the sequences. The two numbering schemes can be toggled on/off and will be displayed above and below the alignment, respectively.
To emphasize regions in the alignment, you can add annotation boxes, i.e. rectangular boxes in a given color and with a given line style. Select Annotation->Add annotation box to add a new annotation box, the numbers are the numbers of the sequences in the alignment and the positions are the positions in the alignment. To edit the annotation boxes for any property (position, color or line style) select Annotation->edit annotation boxes in the menu. Here you can also choose to delete any of the annotation boxes.
The consensus sequence is not the only way to get an overview of the regions in the alignment with high conservation. The entropy graph, showing the entropy in each column in the alignment, is also a useful tool. You can toggle in it on/off in the menu item Annotation->Entropy graph. The total entropy for the alignment is a sum over the entropy in each column – Annotation->Calc total entropy.
Another means of annotation is to vary the color scheme. In the menu item color scheme, you will find many different pre-defined color schemes and with the possibility to re-define any of them to enhance the features you want. You can also choose between coloring boxes with the amino acid letters in black or color the letters or toggle coloring off.
You can select a different font size for the alignment - Menu Font size. And toggle between a normal font and bold.
A last useful annotation option is to assign secondary structure elements to the alignment. You can do it manually (later I will show ways to get it done automatically if one or more protein structures are available). Just go to the area above the alignment and numbering screen and press the left mouse button (alpha helix) or right mouse button (beta strand) and drag the mouse button. You can delete the secondary structure annotation at a position by clicking the middle mouse button. The minimum length for an element is set to 3 (alpha helices) and 2 (beta strand).
Many printers have direct capability to print PostScript files. If your printer does not support PostScript, you can e.g. use GhostView to print PostScript through various drivers. GhostView is also a very nice PostScript viewer. GhostView is also available for Windows not just Unix, e.g. at http://www.cs.wisc.edu/~ghost/gsview/index.htm - you need to install GhostScript as well. You can also use GhostView to convert the PostScript file to e.g. jpg, tiff or png format to include the figure in e.g. MS Word.
When you have chosen a file name a dialog box appears. You can choose which of the annotations you want to display (only those that you have active will appear), and whether to place them above or below the alignment. The sequences and the sequence names are separated by a couple of characters. The spacing might be too large – so in the box separation between sequence names and sequences set a negative number as the separation.
A sequence logo is a concentrated
presentation of the information a multiple alignment (T.D. Schneider and R. M.
Stephens (1990). "Sequence logos: A New Way to Display
Consensus Sequences", Nucl. Ac. Res., 18: 6097-6100). To save a
sequence logo, choose “Sequence Logo” as the file type in the save file menu. When
you have chosen a file name a dialog box appears. Here you can choose paper
size, margins etc. The sequence logo is a small part of the alignment, usually
5-20 positions. You can choose the mid-point of the sequence logo in “zero-point”
to an absolute position in the sequence alignment (e.g. 50) and choose the
width from the zero point to the left and right, respectively. The sequence
logo is saved in PostScript format so you can view it with e.g. GhostView (as
mentioned in “Saving to PostScript”).
If you have a protein structure with known
3D coordinates and would like a suggestion for a 3D model of
a protein sequences that is homologous, you want to perform comparative
modeling.
Start at the
All the other features in the AEW described above are also available. Now you can save the alignment in e.g. FASTA format to use in MODELLER, or any other homology modeling program. This example was rather simple, but it works in more complicated cases too.
If you have more than one protein
structure, it would be advantageous to align the 3D coordinates first and use
this structural alignment to obtain a structure-based sequence alignment
(SBSA). By pressing the other tab called structural alignment in the
You can open any number of files in PDB format. Each chain in each structure is treated as an individual structure. Please note that the alignment is carried out on the C-alpha coordinates! A structure list appears similar to the sequence list for the sequence alignment part.
Two structural alignment methods are available, one by Levitt and Gerstein REFERENCE (fast) and a brute force (slow). The checkbox view aligned structures in Madura indicates whether you would like to see the result in the built-in protein structure viewer. Madura is actually also a separate application that has been coupled to all entries in PDB with published structure factors. More information at the Uppsala University Electron Density Service http://portray.bmc.uu.se/eds. You can choose between aligning the selected protein structure chains or just to display them (in Madura). If you press align structures, the following appears:
In Madura you can rotate the protein structure, zoom etc. In the window on the left, you get some alignment statistics. The P(z>Z) should be as close to zero as possible. If the value is 1.00, the structural alignment failed and you might want to play around with the various structural alignment methods and parameters.
The lower half of the
In the AEW, the alignment appears with secondary structure element annotation and ‘=’ signs under the structurally aligned regions. The secondary structure annotation is only effective inside the aligned regions because it may vary from structure to structure outside the aligned regions. Now you may add the sequence(s) that you would like proposed 3D structures for (and press Annotation->Iterate!).
As for the single structure, you may save the result for use in MODELLER, or any other homology modeling program.
Patrik.
Any comments, requests for features, bug reports etc. are more than welcome.
Dennis Madsen, Patrik Johansson, and Gerard J. Kleywegt.
Contact dennis@xray.bmc.uu.se Patrik@xray.bmc.uu.se gerard@xray.bmc.uu.se
Citation:
D. Madsen, P. Johansson, and G. J. Kleywegt
(2002)