Skip to main content

Nested SQL Queries using multiple tables and pymssql in Python

I have 2 tables with information about contestant races that I need to organize into one clean output statement (not a tuple or list) that only gives each unique contestant along with their region, average race time, and a count of how many races they competed in.

Using "SELECT TABLE_NAME, COLUMN_NAME FROM INFORMATION_SCHEMA.COLUMNS;" returns the following tables I have to work with: Tables, Column_Names


('Outcome', 'ID')
('Outcome', 'RaceID')
('Outcome', 'RaceTime')
('Outcome', 'ResultID')
('Contestant', 'Age')
('Contestant', 'Region')
('Contestant', 'ID')
('Contestant', 'Name')
('Contestant', 'Gender')

Here is what I have so far. I can get the first part to join the tables and return the individual columns that I need, but they aren't in the right format and it is including each race instead of an aggregate of each contestant.

#This block returns all the columns needed from all tables with INNER JOIN
cursor.execute("SELECT Contestant.Name, Contestant.Region, Outcome.RaceTime, Outcome.RaceID FROM Contestant INNER JOIN Outcome ON Contestant.ID=Outcome.ID;")
for row in cursor:
    print(row[0], row[1], row[2], row[3])

The first couple lines that this Query returns are below but they dont have 3 spaces between each variable, not the full region name, not an average of racetime, and showing the raceID instead of count of races.

Smith PNW 64.59 1.0
Kohl SW 50.3 1.0

If this was working the way I needed, it would return this way with 3 spaces between variables, full region name, and average of all the contestants race times, and a count of how many races they were in. This would also then need to be ordered by average time from highest to lowest.

Smith   Pacific Northwest   59.454   7
Kohl   Southwest   54.203   4

Here are other lines I have that will return separately everything else that I need to put into the clean table:

#The following each does a part of what needs to happen with the columns data ultimately
cursor.execute("SELECT Region, CASE WHEN Region = 'PNW' THEN 'Pacific Northwest' ELSE 'Southwest' END AS Region_Long FROM Contestant;") #Updates abbreviated region names to full names
for row in cursor:
    print(row[1]) #prints out just the full region names
cursor.execute("SELECT ID, AVG(RaceTime) FROM Outcome GROUP BY ID ORDER BY AVG(RaceTime) DESC;") #Calculates each Contestants's avg race time
for row in cursor:
    print(row[1])
cursor.execute("SELECT COUNT(DISTINCT RaceID) FROM Outcome GROUP BY ID;") #Counts how many races each Contestant had
for row in cursor:
    print(row[0])

I just can't seem to figure out how to put all of this together to make it output how I need it to instead of just outputting each part separately.



source https://stackoverflow.com/questions/75123539/nested-sql-queries-using-multiple-tables-and-pymssql-in-python

Comments

Popular posts from this blog

ValueError: X has 10 features, but LinearRegression is expecting 1 features as input

So, I am trying to predict the model but its throwing error like it has 10 features but it expacts only 1. So I am confused can anyone help me with it? more importantly its not working for me when my friend runs it. It works perfectly fine dose anyone know the reason about it? cv = KFold(n_splits = 10) all_loss = [] for i in range(9): # 1st for loop over polynomial orders poly_order = i X_train = make_polynomial(x, poly_order) loss_at_order = [] # initiate a set to collect loss for CV for train_index, test_index in cv.split(X_train): print('TRAIN:', train_index, 'TEST:', test_index) X_train_cv, X_test_cv = X_train[train_index], X_test[test_index] t_train_cv, t_test_cv = t[train_index], t[test_index] reg.fit(X_train_cv, t_train_cv) loss_at_order.append(np.mean((t_test_cv - reg.predict(X_test_cv))**2)) # collect loss at fold all_loss.append(np.mean(loss_at_order)) # collect loss at order plt.plot(np.log(al...

Sorting large arrays of big numeric stings

I was solving bigSorting() problem from hackerrank: Consider an array of numeric strings where each string is a positive number with anywhere from to digits. Sort the array's elements in non-decreasing, or ascending order of their integer values and return the sorted array. I know it works as follows: def bigSorting(unsorted): return sorted(unsorted, key=int) But I didnt guess this approach earlier. Initially I tried below: def bigSorting(unsorted): int_unsorted = [int(i) for i in unsorted] int_sorted = sorted(int_unsorted) return [str(i) for i in int_sorted] However, for some of the test cases, it was showing time limit exceeded. Why is it so? PS: I dont know exactly what those test cases were as hacker rank does not reveal all test cases. source https://stackoverflow.com/questions/73007397/sorting-large-arrays-of-big-numeric-stings

How to load Javascript with imported modules?

I am trying to import modules from tensorflowjs, and below is my code. test.html <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <title>Document</title </head> <body> <script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs@2.0.0/dist/tf.min.js"></script> <script type="module" src="./test.js"></script> </body> </html> test.js import * as tf from "./node_modules/@tensorflow/tfjs"; import {loadGraphModel} from "./node_modules/@tensorflow/tfjs-converter"; const MODEL_URL = './model.json'; const model = await loadGraphModel(MODEL_URL); const cat = document.getElementById('cat'); model.execute(tf.browser.fromPixels(cat)); Besides, I run the server using python -m http.server in my command prompt(Windows 10), and this is the error prompt in the console log of my browser: Failed to loa...