Filter data imported by gspread Python 3.x

Question

Filter data imported by gspread Python 3.x

Navigation

#1 by (1 votes)

4

I'm importing data from a Google Sheets spreadsheet, but I need to filter some rows and not load all of them. Someone runs this library, and knows if there is an argument to filter rows.

I am doing it in the following way so far:

    import gspread
    from oauth2client.service_account import ServiceAccountCredentials

    scope=['https://spreadsheets.google.com/feeds']
    creds=ServiceAccountCredentials.from_json_keyfile_name('Python-                        cbe875b7fd9f.json', scope)
    client=gspread.authorize(creds)
    sheet=client.open('Prueba Python').sheet1
    preventivos=sheet.get_all_records()
    print(preventivos)

This I do since there is a column that has several options for example "NOGO", "MOC" and "AHM" option. I want to import only those rows that have the "AHM" option. In this way the load will be faster because now it takes several seconds and there is a lot of data that I do not care to import.

Example file:

link

python google-sheets

asked by Jorge Ponti 02.08.2017 в 16:21

source

1 answer

Iterative query with permissions Redirect to app play store from html

score 1 · Answer 1

Jorge, the documentation of this library indicates for the routine get_all_records() :

Returns to list of dictionaries, all of them having: the contents of The spreadsheet with the head row as keys, And each of these dictionaries holding - the contents of subsequent rows of cells as values.

That is, a list of dictionaries where each key is the column header. Something like the following, based on your example:

lista = [
          {"B/C": "T00APSUL",   "NOMBRE": "PREVENTIVO", "FECHA": "2016-10-07 00:07:55", "AVION": "CC-BAS", "FLOTA": "A320C", "ATA": "550" , "DESCRIPCION": "NO GO VIBRATION REPORTED BY PIREP", "H1": "COMPLETE", "FUENTE": "NOGO"},
          {"B/C": "T00AQ4HM",   "NOMBRE": "PREVENTIVO", "FECHA": "2016-10-08 10:40:26", "AVION": "CC-BGG", "FLOTA": "B787", "ATA": "45" , 
           "DESCRIPCION": "DSO REQUEST R/R ENGINE DATA ACMF REPORT DOES NOT DOWNLINK", "H1": "COMPLETE", "FUENTE": "AHM"}
        ]

I did not transcribe all the records, we must also see the behavior in front of columns without headers, for example what would be the column H of your spreadsheet. With this data the filter is something like this:

print([e for e in lista if e["FUENTE"] == "AHM"])

That is, we return the rows where in the column "SOURCE" we find the text "AHM".