#### Category : data-analysis

Goal: I have a column called Resolution. What you can see in the picture. Let’s say the first cell 1600 x 900 I need to get rid of the x in order to have an integer value. So the results should be 1600 900. How can I do that? I have tried to search online ..

I was trying to add a constant slope in a existing time series graph while values are decreasing. The values should follow some conditions. Can anyone tell me which way I should follow? The conditions are as follows when value goes 105 to 30 it maintains constant slope -16.66. if the values does not reach ..

I can not understand how Matplotlib hexagonal bin (hexbin) plots hexagons from data. How are (x, y) coordinates in hexagons calculated whether alone passed to function or with c parameter and function? Would you please explain how Matplotlib calculates the data displayed as hexagons (equation(s) and example preferred), and how do the c parameter and ..

I am working on a analytical dashboard where I extract data from multiple sources within this organization and analyze them. I managed to extract from one database. The second one there seems to be a connection error. Everything is under the same network but I guess they put up a firewall for this particular one ..

How to assign dynamiclly numbers to each unique value? I’ve searched but I can only see 1 answer: # creating a dict file gender = {‘male’: 1,’female’: 2} # traversing through dataframe # Gender column and writing # values where key matches data.Gender = [gender[item] for item in data.Gender] print(data) But these answer uses fixed ..

I am using Spyder and the following source code: import pandas as pd filename = "file.csv" # 5.35 GB in size df = pd.read_csv(filename, nrows=5) pd.set_option(‘display.max_columns’, None) df Output runfile(‘C:/Users/pc/Desktop/Data Mining/file.py’, wdir=’C:/Users/pc/Desktop/Data Mining/’) Reloaded modules: jupyter_client.session, zmq.eventloop, zmq.eventloop.ioloop, tornado.platform, tornado.platform.asyncio, tornado.gen, zmq.eventloop.zmqstream, jupyter_client.jsonutil, jupyter_client.adapter, spyder, spyder.pil_patch, PIL, PIL._version, PIL.Image, PIL.ImageMode, PIL.TiffTags, PIL._binary, PIL._util, PIL._imaging, cffi, ..

I have a set of emails with extracted array of keywords and with metalabel. I want to use HDBSACN in python to make topic clustering but I cannot find any example whit is corect format of data to use in hdbscan. class Mail(object): id = 1 keywords = [("word1",0.45),("word2",0.36)…] metalabel = "metalabel" hdbscan.HDBSCAN(min_cluster_size=5, metric=’euclidean’, cluster_selection_method=’eom’).fit(???) ..