PySpark AttributeError: 'SQLContext' object has no attribute 'cursor' - Solution and Explanation - 常规

This error occurs when you try to use the cursor() method on an object of SQLContext class in PySpark.

SQLContext is a class in PySpark that is used to interact with structured data using SQL-like syntax. It does not have a cursor() method as it is not used to directly connect to databases.

To fix this error, you need to use a different object that has cursor() method. For example, if you want to execute SQL queries on a database, you can use pyspark.sql.DataFrameReader class to read data from a database and then use cursor() method to execute SQL queries on the data.

Here's an example:

from pyspark.sql import SparkSession
spark = SparkSession.builder.appName('myApp').getOrCreate()

df = spark.read.format('jdbc').option('url', 'jdbc:mysql://localhost:3306/mydb').option('dbtable', 'mytable').option('user', 'myuser').option('password', 'mypassword').load()

conn = df.write.format('jdbc').option('url', 'jdbc:mysql://localhost:3306/mydb').option('dbtable', 'mytable2').option('user', 'myuser').option('password', 'mypassword').mode('overwrite').jdbc()

cursor = conn.cursor()
cursor.execute('SELECT * FROM mytable2')
result = cursor.fetchall()

In this example, we use DataFrameReader to read data from a MySQL database and then write the data to another table using write() method. We then use cursor() method on the connection object returned by write() method to execute SQL queries on the data.

PySpark AttributeError: 'SQLContext' object has no attribute 'cursor' - Solution and Explanation