Thanks for the reply! So briefly recapping: my data consists of numbers and I am trying to predict next number(s) based on the previous numbers (LSTM). The observed data looks like this:
0000001
0000002
0000003
0000004
0000005
0000006
0000007
0000008
0000009
0000010
In my data.csv file I have created sequences (manually) by using concatenating three subsequent observed numbers and then using next observed number in the data as the prediction. Each line is repeats the same procedure, which results the following file content:
numbers,predictions
0000001 0000002 0000003,0000004
0000002 0000003 0000004,0000005
0000003 0000004 0000005,0000006
0000004 0000005 0000006,0000007
0000005 0000006 0000007,0000008
0000006 0000007 0000008,0000009
0000007 0000008 0000009,0000010
My model definition mdf.yaml file content:
input_features:
-
name: numbers
type: sequence
encoder: cnnrnn
cell_type: lstm
output_features:
-
name: predictions
type: sequence
decoder: generator
Running the command
ludwig train -mdf mdf.yaml --data_csv data.csv
Results into a error
--- snip ---
raise ValueError("Cannot convert an unknown Dimension to a Tensor: %s" % d)
ValueError: Cannot convert an unknown Dimension to a Tensor: ?
Tried looking examples in the guide, but clearly not understanding enough yet...