I have 2 tables with information about contestant races that I need to organize into one clean output statement (not a tuple or list) that only gives each unique contestant along with their region, average race time, and a count of how many races they competed in.
Using "SELECT TABLE_NAME, COLUMN_NAME FROM INFORMATION_SCHEMA.COLUMNS;"
returns the following tables I have to work with: Tables, Column_Names
('Outcome', 'ID')
('Outcome', 'RaceID')
('Outcome', 'RaceTime')
('Outcome', 'ResultID')
('Contestant', 'Age')
('Contestant', 'Region')
('Contestant', 'ID')
('Contestant', 'Name')
('Contestant', 'Gender')
Here is what I have so far. I can get the first part to join the tables and return the individual columns that I need, but they aren't in the right format and it is including each race instead of an aggregate of each contestant.
#This block returns all the columns needed from all tables with INNER JOIN
cursor.execute("SELECT Contestant.Name, Contestant.Region, Outcome.RaceTime, Outcome.RaceID FROM Contestant INNER JOIN Outcome ON Contestant.ID=Outcome.ID;")
for row in cursor:
print(row[0], row[1], row[2], row[3])
The first couple lines that this Query returns are below but they dont have 3 spaces between each variable, not the full region name, not an average of racetime, and showing the raceID instead of count of races.
Smith PNW 64.59 1.0
Kohl SW 50.3 1.0
If this was working the way I needed, it would return this way with 3 spaces between variables, full region name, and average of all the contestants race times, and a count of how many races they were in. This would also then need to be ordered by average time from highest to lowest.
Smith Pacific Northwest 59.454 7
Kohl Southwest 54.203 4
Here are other lines I have that will return separately everything else that I need to put into the clean table:
#The following each does a part of what needs to happen with the columns data ultimately
cursor.execute("SELECT Region, CASE WHEN Region = 'PNW' THEN 'Pacific Northwest' ELSE 'Southwest' END AS Region_Long FROM Contestant;") #Updates abbreviated region names to full names
for row in cursor:
print(row[1]) #prints out just the full region names
cursor.execute("SELECT ID, AVG(RaceTime) FROM Outcome GROUP BY ID ORDER BY AVG(RaceTime) DESC;") #Calculates each Contestants's avg race time
for row in cursor:
print(row[1])
cursor.execute("SELECT COUNT(DISTINCT RaceID) FROM Outcome GROUP BY ID;") #Counts how many races each Contestant had
for row in cursor:
print(row[0])
I just can't seem to figure out how to put all of this together to make it output how I need it to instead of just outputting each part separately.
source https://stackoverflow.com/questions/75123539/nested-sql-queries-using-multiple-tables-and-pymssql-in-python
Comments
Post a Comment