Datagen Input Object
The Integrator Datagen input object provides data generation capability within the Integrator. A Datagen object creates a given number of rows containing integer sequences and random data of various types.
Datagen Attributes
| Attribute | Type | Description |
|---|---|---|
| process_type (required) |
String | Identifies the object as a Datagen input object. The value of this string is "datagen". |
| count (required) |
Integer | Defines the number of rows to generate. |
| seed | Integer | Defines the seed (default) used to initialize the pseudo-random number generator. Using a seed
allows one to regenerate the same set of numbers, else each script execution will generate a new
data set.
If non-zero, the random data will be generated based on this seed. If 0 or not set, the seed will be based on the time the Data Integrator was run. |
| trace_after | Sub-object |
Traces data flows leaving the specified object. This is equivalent to adding a Trace process object immediately after the current object. See Embedded Trace Object for more on using trace sub-objects. |
| gen_columns (required) |
Array of Objects | Describes input columns for the generated data. Each sub-object can have a variety of attributes as described in the following table: |
Datagen Sub-Object Attributes
| Attribute | Type | Description |
|---|---|---|
| name | String | Defines the input column name. |
| type | String | Specifies the generating type for the column. It controls the contents and the randomness of
data. The type attribute can have the following values:
|
| distribution | String | Controls distribution of random values, and is valid for all generating types except sequence.
Values include:
|
| strings | Array of Strings | Contains an array of string values to be used by the strings generating type (see type attribute above). |
| start | Integer for sequence, String for date | If the generating type is "sequence", this is the start of the sequence, with a default of 1. If the generating type is "date", this is the start of the date range, with no default. Use the standard DI date format. |
| end | Integer for sequence, String for date | If the generating type is "sequence", this is the end of the sequence, with a default of the count of rows being generated. If the generating type is "date", this is the end of the date range, with a default of the current date when the Data Integrator is started. Use the standard DI date format. |
| max | Integer or Decimal | Indicates the highest possible value that can be generated. |
| min | Integer or Decimal | Indicates the lowest possible value that can be generated. |
| length | Integer | Determines the maximum length of a word. It defaults to 8. |
| min_length | Integer | If the generating type is "word", defines the minimum length of the word. It defaults to be equal to length. |
| capitalize | Boolean | If this attribute is "true" and the generating type is "word", the generated word is capitalized. The default is not capitalized. |