Skip to main content

I am working on airport data and I am new to python. I have a code in STATA and I need help to convert it into python [closed]

I am working on airport data and I am new to python. I have a code in STATA and I need help to convert it into python. The code is as follows:

gen FlightID= origin+ dest+ op_unique_carrier+ string(op_carrier_fl_num)

egen N_Flights = count( FlightID ), by ( FlightID )

 

keep fl_date op_unique_carrier tail_num origin dest crs_dep_time dep_time dep_delay taxi_out wheels_off wheels_on taxi_in crs_arr_time arr_time FlightID N_Flights

drop taxi_in crs_arr_time //crs_dep_time

drop if dep_time==.

drop if wheels_off==.

keep if origin=="EWR"

drop if wheels_off<dep_time //revisit to bring back those flights around midnight

sort fl_date dep_time

gen fl_id=_n

//gen Departed = 1

//replace Departed= Departed[_n-1]+1 if _n>1

sort fl_date wheels_off

//gen Tookoff = 0

//replace Tookoff= Tookoff[_n-1]+1 if _n>1

//gen DepartedQ=Departed-Tookoff

gen date=date(fl_date, "YMD")

format date %td

gen dow=dow( date)+1

gen QuJump=0

replace QuJump=1 if dep_time>dep_time[_n+1] & date==date[_n+1]

gen QuJumpMin=(dep_time-dep_time[_n+1])+(wheels_off[_n+1]-wheels_off)

summarize QuJumpMin if QuJumpMin>0

gen id = _n

gen obs=2

expand obs

drop obs

sort id

gen time_stamp=dep_time

replace time_stamp=wheels_off if id==id[_n-1]

gen stamp="dep_time"

replace stamp="wheels_off" if id==id[_n-1]

sort fl_date time_stamp

gen Qid = _n

gen QLength=1

replace QLength=cond(stamp=="dep_time" & _n>=2,QLength[_n-1]+1,cond(stamp=="wheels_off" & _n>=2,QLength[_n-1]-1,QLength))

gen Tookoff = 0

replace Tookoff=cond(stamp=="wheels_off" & _n>=2,Tookoff[_n-1]+1,cond(stamp=="dep_time" & _n>=2,Tookoff[_n-1],Tookoff))

sort fl_id stamp

gen QJumpSpots=QLength-(Tookoff[_n+1]-Tookoff) if stamp=="dep_time" & fl_id==fl_id[_n+1]


source https://stackoverflow.com/questions/72382902/i-am-working-on-airport-data-and-i-am-new-to-python-i-have-a-code-in-stata-and

Comments

Popular posts from this blog

ValueError: X has 10 features, but LinearRegression is expecting 1 features as input

So, I am trying to predict the model but its throwing error like it has 10 features but it expacts only 1. So I am confused can anyone help me with it? more importantly its not working for me when my friend runs it. It works perfectly fine dose anyone know the reason about it? cv = KFold(n_splits = 10) all_loss = [] for i in range(9): # 1st for loop over polynomial orders poly_order = i X_train = make_polynomial(x, poly_order) loss_at_order = [] # initiate a set to collect loss for CV for train_index, test_index in cv.split(X_train): print('TRAIN:', train_index, 'TEST:', test_index) X_train_cv, X_test_cv = X_train[train_index], X_test[test_index] t_train_cv, t_test_cv = t[train_index], t[test_index] reg.fit(X_train_cv, t_train_cv) loss_at_order.append(np.mean((t_test_cv - reg.predict(X_test_cv))**2)) # collect loss at fold all_loss.append(np.mean(loss_at_order)) # collect loss at order plt.plot(np.log(al...

Sorting large arrays of big numeric stings

I was solving bigSorting() problem from hackerrank: Consider an array of numeric strings where each string is a positive number with anywhere from to digits. Sort the array's elements in non-decreasing, or ascending order of their integer values and return the sorted array. I know it works as follows: def bigSorting(unsorted): return sorted(unsorted, key=int) But I didnt guess this approach earlier. Initially I tried below: def bigSorting(unsorted): int_unsorted = [int(i) for i in unsorted] int_sorted = sorted(int_unsorted) return [str(i) for i in int_sorted] However, for some of the test cases, it was showing time limit exceeded. Why is it so? PS: I dont know exactly what those test cases were as hacker rank does not reveal all test cases. source https://stackoverflow.com/questions/73007397/sorting-large-arrays-of-big-numeric-stings

How to load Javascript with imported modules?

I am trying to import modules from tensorflowjs, and below is my code. test.html <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <title>Document</title </head> <body> <script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs@2.0.0/dist/tf.min.js"></script> <script type="module" src="./test.js"></script> </body> </html> test.js import * as tf from "./node_modules/@tensorflow/tfjs"; import {loadGraphModel} from "./node_modules/@tensorflow/tfjs-converter"; const MODEL_URL = './model.json'; const model = await loadGraphModel(MODEL_URL); const cat = document.getElementById('cat'); model.execute(tf.browser.fromPixels(cat)); Besides, I run the server using python -m http.server in my command prompt(Windows 10), and this is the error prompt in the console log of my browser: Failed to loa...