Difference between revisions of "Data Structures"

From Kolmafia
Jump to navigation Jump to search
imported>Dj d
imported>Heeheehee
(Added record info.)
Line 1: Line 1:
= maps =
+
= Maps =
 
Most of this information was copied directly from ASH Maps Tutorial, by Veracity  
 
Most of this information was copied directly from ASH Maps Tutorial, by Veracity  
 
(http://kolmafia.sourceforge.net/advanced.html#maps)
 
(http://kolmafia.sourceforge.net/advanced.html#maps)
Line 9: Line 9:
 
You can fetch data from a map any time you can provide a data value: in an expression, as a function parameter, on the right side of an assignment statement, from a "return" statement, as so on. You can pass around entire maps, individual elements, or intermediate maps: "slices".
 
You can fetch data from a map any time you can provide a data value: in an expression, as a function parameter, on the right side of an assignment statement, from a "return" statement, as so on. You can pass around entire maps, individual elements, or intermediate maps: "slices".
  
== assignments ==
+
== Assignments ==
  
 
If you use a map on the left side of an assignment, you set the whole map at once to the new value. If you specify a map and a complete set of indices (of the correct types) on the left side of an assignment statement, you set a single element. If you specify a map and a prefix of indices (of the correct type), you directly set one of the intermediate maps, a "slice".
 
If you use a map on the left side of an assignment, you set the whole map at once to the new value. If you specify a map and a complete set of indices (of the correct types) on the left side of an assignment statement, you set a single element. If you specify a map and a prefix of indices (of the correct type), you directly set one of the intermediate maps, a "slice".
  
== declarations ==
+
== Declarations ==
  
 
The syntax for declaring the data type of a map:
 
The syntax for declaring the data type of a map:
Line 19: Line 19:
 
     <data type> [ <key type>, ... ]   
 
     <data type> [ <key type>, ... ]   
  
== references ==
+
== References ==
  
 
The syntax for referencing an element (or slice) of a map:
 
The syntax for referencing an element (or slice) of a map:
Line 49: Line 49:
 
     animal[ "fun" ] => false   
 
     animal[ "fun" ] => false   
  
== contains ==
+
== Contains ==
  
 
You can test the presence of a key in a map using the "contains" operator:
 
You can test the presence of a key in a map using the "contains" operator:
Line 63: Line 63:
 
     animal contains "favorite food" => false   
 
     animal contains "favorite food" => false   
  
== remove ==
+
== Remove ==
  
 
You can remove a key-value association from a map using the "remove" unary operator:
 
You can remove a key-value association from a map using the "remove" unary operator:
Line 101: Line 101:
 
   0 aggregate int [string]
 
   0 aggregate int [string]
  
== deletion ==
+
== Deletion ==
  
 
clear (theMap) will remove all entries
 
clear (theMap) will remove all entries
  
  
== sort ==
+
== Sort ==
  
 
From http://kolmafia.us/showthread.php?t=1738
 
From http://kolmafia.us/showthread.php?t=1738
Line 125: Line 125:
 
* "by random(1000000)" shuffles into a random order.
 
* "by random(1000000)" shuffles into a random order.
 
* "by otherArray[index]" uses values from a parallel array as the sort keys (you'd then need to do "sort otherArray by value;" if you wanted the two arrays to remain in sync).
 
* "by otherArray[index]" uses values from a parallel array as the sort keys (you'd then need to do "sort otherArray by value;" if you wanted the two arrays to remain in sync).
 +
 +
== Records ==
 +
 +
(copy-pasted from Veracity's post introducing the record [http://kolmafia.us/showthread.php?t=280])
 +
 +
Starting with SVN revision 1311 of KoLmafia, ASH now supports a new kind of structured data: the record. Here is a little example of how you declare a record and variables of the new type you've created by doing so.
 +
 +
  record my_type {
 +
  int ifield;
 +
  string sfield;
 +
  record {
 +
  int first;
 +
  int second;
 +
  } rfield;
 +
  int [int, int] mfield;
 +
  };
 +
 
 +
  my_type rvar;
 +
  my_type [int] mrvar;
 +
 +
What I've done with the above is declare a new data type which I've named "my_type". Having declared the new type, I can use it (almost) anywhere that I can use a built-in type name. I declared a variable, "rvar", of that type, and I defined a map, "mrvar", which maps keys of type integer to values of type my_type.
 +
 +
The new type, "my_type" is a "composite" type. It contains four fields. "ifield" is an integer. "sfield" is a string. "rfield" is another composite field: an anonymous record containing two integers named "first" and "second". Finally, "mfield" is a map from [int, int] to int.
 +
 +
As you can see, a record can combine data of all the types ASH supports: primitive, aggregate, and composite.
 +
 +
Having defined the new data type and several variables using it, here are some examples of how to access the fields.
 +
 +
  rvar.ifield = 10;
 +
  rvar.sfield = "secret";
 +
  rvar.rfield.first = 1000;
 +
  rvar.sfield.second = 2000;
 +
  rvar.mfield[ 2, 3 ] = 12;
 +
 
 +
  mrvar[ 1 ] = rvar;
 +
 
 +
  foreach key in mrvar
 +
  foreach key1, key2 in mrvar[key].mfield
 +
  print( "val = " + mrvar[key].mfield[key1,key2] );
 +
 +
As you can see, if you have a variable that is a record, you access the fields of the record by following the variable name with ".&lt;field name&gt;". The resulting value will be of whatever type you declared in the definition of the record. If the value is a map, you can give a list of keys within [], just like any other map. If the value is another record, you can access the fields of the nested record by using another ".&lt;field name&gt;".
 +
 +
If you are familiar with Pascal "records" or C/C++ "structs", this should all be comfortably familiar.
 +
 +
Finally, if you create a map whose values is a record, the file_to_map and map_to_file built-in ASH functions will Do The Right Thing; they will efficiently and reliably save and restore your data.

Revision as of 05:49, 27 February 2010

Maps

Most of this information was copied directly from ASH Maps Tutorial, by Veracity (http://kolmafia.sourceforge.net/advanced.html#maps)

A map is indexed by one data type (the key) and associates that key with another (or the same) data type (the value). The key can be any ASH simple data type: boolean, int, float, string, item, zodiac, location, class, stat, skill, effect, familiar, slot, or monster. The value can be any of those or can be an aggregate: another map. This effectively allows multi-dimensional maps and. In fact, that's how the syntax we provide for multi-dimensional aggregates actually operates: maps of maps of maps ...

You can declare a map any time you can declare a variable: as a top level (global) variable, as a function parameter, or as a local variable in any scope.

You can fetch data from a map any time you can provide a data value: in an expression, as a function parameter, on the right side of an assignment statement, from a "return" statement, as so on. You can pass around entire maps, individual elements, or intermediate maps: "slices".

Assignments

If you use a map on the left side of an assignment, you set the whole map at once to the new value. If you specify a map and a complete set of indices (of the correct types) on the left side of an assignment statement, you set a single element. If you specify a map and a prefix of indices (of the correct type), you directly set one of the intermediate maps, a "slice".

Declarations

The syntax for declaring the data type of a map:

    [ <key type>, ... ]  

References

The syntax for referencing an element (or slice) of a map:

   <variablename>[ <key expression>, ... ]  

All the key expressions will be evaluated at run time. If you specify all the keys the map expects, you fetch data of the type specified by the map. If you specify fewer keys than the map expects, you get an intermediate map, a "slice".

As an example:

   boolean [string, string] props;  

might be used to hold "properties" associated with names.

   props[ "dog", "mammal" ] = true; 
   props[ "dog", "pet" ] = true; 
   props[ "dog", "fun" ] = false;
   props[ "turtle", "mammal" ] = false;
   props[ "turtle", "pet" ] = true;
   props[ "turtle", "fun" ] = false;
   props[ "aardvark", "mammal" ] = true;
   props[ "aardvark", "pet" ] = false;
   props[ "aardvark", "fun" ] = true;  

references:

   props[ "dog", "mammal"] => true 
   boolean [string] animal = props[ "turtle" ]; 
   animal[ "fun" ] => false  

Contains

You can test the presence of a key in a map using the "contains" operator:

   <aggregate reference expression> contains <key expression>  

Where <aggregate reference expression> must evaluate at run time to a map or slice, and must evaluate at run time to a key of the appropriate type. (Note that that is enforced at parse time; ASH can tell the datatype any expression will produce).

   props contains "dog" => true 
   props contains "elephant" => false 
   props[ "aardvark" ] contains "fun" => true 
   animal contains "pet" => true 
   animal contains "favorite food" => false  

Remove

You can remove a key-value association from a map using the "remove" unary operator:

   remove <aggregate reference>  

For clarification, an aggregate reference is "<map name>[ <index 1> ... <index n> ]" where <map name>[ <index 1> ... <index n-1> ] specifies the "slice" and <index n> specifies the "key". Which is just what you expect, if you fully specify the indices; for a single dimensional map, "map[10]" -> "map" is the slice and 10 is the key. The "remove" operator removes the "key" from the "slice". For example:

 string [int] map1;
 map1[5] = "foo";
 print( count( map1 ) + " " + map1 contains 5 + " " + map1[5] );
 print( "remove: " + remove map1[5] );
 print( count( map1 ) + " " + map1 contains 5 + " "  + map1[5] );
 print( "remove: " + remove map1[5] );
 int [string, string] map2;
 map2["me","you"] = 17;
 print( count( map2["me"] ) + " " + map2["me"] contains "you" + " " + map2["me","you"] );
 print( "remove: " + remove map2["me", "you"] );
 print( count( map2["me"] ) + " " + map2["me"] contains "you" + " " + map2["me","you"] );
 print( "remove: " + remove map2["me", "you"] );
 print( count( map2 ) + " " + map2["me"] );
 print( "remove: " + remove map2["me"] );
 print( count( map2 ) + " " + map2["me"] );

yields:

 1 true foo
 remove: foo
 0 false
 remove:
 1 true 17
 remove: 17
 0 false 0
 remove: 0
 1 aggregate int [string]
 remove: aggregate int [string]
 0 aggregate int [string]

Deletion

clear (theMap) will remove all entries


Sort

From http://kolmafia.us/showthread.php?t=1738

The syntax is: sort aggregate by keyExpr; aggregate is a reference to the object to be sorted - arrays are probably the most useful things to sort, but any mapping type can be used (even multidimensional maps, but note that you can only sort along a single dimension at a time). The reference must not be enclosed in parentheses, as that would look like a call to a function named sort() - which is still perfectly valid, "sort" has not become a reserved word.

keyExpr is an arbitrary expression that defines how the items should be ordered. It is evaluated once for every entry in the aggregate, in a scope with two additional variables implicitly defined: 'index' and 'value', holding the details of that entry. The value of the keyExpr is used as the sort key; typically it would be an int or string, but can be any ASH type that can be compared via "<" and the other relational operators.

The most basic form of sorting would therefore be "sort ... by value", but many useful things can be done with the use of a more complex keyExpr - the only real restriction is that the expression should not modify the object you're sorting. For example, if you had an array of items, you could sort it "by autosell_price(value)". An array of weapon items could be sorted "by -get_power(value)" to put it in decreasing order of power. If the elements of your aggregate are records, you'd need to use something like "by value.fieldName", since the records themselves can't be meaningfully compared.

After the sort statement, the aggregate will have exactly the same sets of keys and values as before (even if the keys weren't consecutive), and the iteration order of the keys will be the same, but the values will likely be associated with different keys. The sort is stable - in other words, elements with sort keys that compare as equal will remain in the same order. This means that you can sort on multiple criteria by simply performing separate sorts for each of the criteria, in increasing order of significance.

A few more examples of things you can do:

  • "by -value" sorts integers in decreasing order (there's no similar trick for string values).
  • "by -index" reverses the existing order of an array (or map with integer keys).
  • "by random(1000000)" shuffles into a random order.
  • "by otherArray[index]" uses values from a parallel array as the sort keys (you'd then need to do "sort otherArray by value;" if you wanted the two arrays to remain in sync).

Records

(copy-pasted from Veracity's post introducing the record [1])

Starting with SVN revision 1311 of KoLmafia, ASH now supports a new kind of structured data: the record. Here is a little example of how you declare a record and variables of the new type you've created by doing so.

 record my_type {
 	int ifield;
 	string sfield;
 	record {
 		int first;
 		int second;
 	} rfield;
 	int [int, int] mfield;
 };
 
 my_type rvar;
 my_type [int] mrvar;

What I've done with the above is declare a new data type which I've named "my_type". Having declared the new type, I can use it (almost) anywhere that I can use a built-in type name. I declared a variable, "rvar", of that type, and I defined a map, "mrvar", which maps keys of type integer to values of type my_type.

The new type, "my_type" is a "composite" type. It contains four fields. "ifield" is an integer. "sfield" is a string. "rfield" is another composite field: an anonymous record containing two integers named "first" and "second". Finally, "mfield" is a map from [int, int] to int.

As you can see, a record can combine data of all the types ASH supports: primitive, aggregate, and composite.

Having defined the new data type and several variables using it, here are some examples of how to access the fields.

 rvar.ifield = 10;
 rvar.sfield = "secret";
 rvar.rfield.first = 1000;
 rvar.sfield.second = 2000;
 rvar.mfield[ 2, 3 ] = 12;
 
 mrvar[ 1 ] = rvar;
 
 foreach key in mrvar
 	foreach key1, key2 in mrvar[key].mfield
 		print( "val = " + mrvar[key].mfield[key1,key2] );

As you can see, if you have a variable that is a record, you access the fields of the record by following the variable name with ".<field name>". The resulting value will be of whatever type you declared in the definition of the record. If the value is a map, you can give a list of keys within [], just like any other map. If the value is another record, you can access the fields of the nested record by using another ".<field name>".

If you are familiar with Pascal "records" or C/C++ "structs", this should all be comfortably familiar.

Finally, if you create a map whose values is a record, the file_to_map and map_to_file built-in ASH functions will Do The Right Thing; they will efficiently and reliably save and restore your data.