i have following DataSet initially: value;date 100;2021-01-01 160;2021-02-01 250;2021-02-15 10;2021-03-01 90;2021-04-01 150;2021-04-15 350;2021-06-01 20;2021-07-01 100;2021-08-01 10;2021-08-10 Whenever the value "Value" drops (e.g. from 250 to 10 on 2021-03-01), I want to save the old value as offset. When the value drops again (e.g. from 350 to 20 on 2021-07-01) I want to add the new ..
I have several labelled text datasets, representing different events (with different topics). Taking a certain event data as a test set each time, I have noticed that some training sets (events) generalises much better than others (even if they are smaller) despite the model that been used (Bert, SVM, etc.) I was wondering if there ..
This is my dataset now How do I code it into this dataset? Source: Python..
new to programming Question is I have downloaded the dataset using this command wget http://opus.lingfil.uu.se/download.php?f=OpenSubtitles/en.tar.gz and now I have to unzip this dataset using this command #Making assumption that user hasn’t put any other tar files in a folder #Two tarball extractions because during testing it downloaded as .gz once? tar -xvf *.tar tar -xvf ..
I have 2 dataset which is of shape : X = 3114 x 627 y = 3114 x 1 (species) Each 9 rows (spectras) from X corresponds to 1 strain, thus there is 3114/9 = 346 strains. The goal is to predict the species of each strains. I was wondering what is the best way ..
I had originally asked this on the data science stack exchange but was told to post it here. I am wondering what is the best practice for importing a large amount of data (using Python) that requires querying a server. For context, consider the following stock data setting. We have a list of ticker symbols, ..
Good afternoon. Unfortunately, I did not find an answer to a simple question. I have a document folder. PDF format. I can use Pandas to open one document and add its text to an array. Where the first column is the folder name and the second is the text from the document. But how do ..
I want to change the label of data from one-hot to multi hot. In other words, I’d like to create target encoding. For example, In pytorch MNIST datasets (image, label) ([[..],[..],[..]], 6) —–> ([[..],[..],[..]], [1 1 0 0 1 0 1 0 0 1]) ([[..],[..],[..]], 1) —–> ([[..],[..],[..]], [1 0 0 1 1 0 1 ..
I downloaded a zip file with the extension .dataset.xz when I unzipped the file I got .dataset file the file supposed to contain noisy images that I need to read one by one, how do I open the contents of this file Source: Python..
Good morning, For my project, I need to generate random coordinates in the form of latitude and longitude pairs, where the generated coordinates need to be inside the borders of the USA, and ideally, there would be a higher probability for a coordinate to generate near the biggest cities in the USA. I have a ..