EasyXLS

How to import data from Excel to DataSet in Python

EasyXLS Excel library can be used to import Excel files with Python on Windows, Linux, Mac or other operating systems. The integration vary depending on the operating system or if the bridge for .NET Framework of Java is chosen:

EasyXLS on Windows using .NET Framework with Python

If you opt for the .NET version of EasyXLS, the below code requires Pythonnet, a bridge between Python and .NET Framework.

Step 1: Download and install EasyXLS Excel Library for .NET

To download the trial version of EasyXLS Excel Library, press the below button:

Download EasyXLS™ Excel Library for .NET

If you already own a license key, you may login and download EasyXLS from your account.

Step 2: Install Pythonnet

For the installation you need to run "pip" command as it follows. Pip is a package-management system used to install and manage software packages written in Python.
<Python installation path>\Scripts>pip install "pythonnet.whl"

Step 3: Include EasyXLS library into project

EasyXLS.dll must be added to your project. EasyXLS.dll can be found after installing EasyXLS, in "Dot NET version" folder.

Step 4: Run Python code that imports data from Excel to DataSet

Execute the following Python code that imports Excel data to DataSet.

"""----------------------------------------------------------
Tutorial 34

This tutorial shows how to import Excel to DataSet in Python. 
The data is imported from the active sheet of the Excel file
(the Excel file generated in Tutorial 09).
----------------------------------------------------------"""

import clr
import gc

clr.AddReference('EasyXLS')
from EasyXLS import *

print("Tutorial 34\n-----------\n")

# Create an instance of the class that imports Excel files
workbook = ExcelDocument()

# Import Excel file to DataSet
print("Reading file C:\\Samples\\Tutorial09.xlsx.\n")
ds = workbook.easy_ReadXLSXActiveSheet_AsDataSet("C:\\Samples\\Tutorial09.xlsx")

# Display imported DataSet values
dt = ds.Tables[0]
for row in range(dt.Rows.Count):
    for column in range(dt.Columns.Count):
        print("At row " + str(row + 1) + ", column " + str(column + 1) +
            " the value is '" + dt.Rows[row].ItemArray[column] + "'")

# Dispose memory
gc.collect()

EasyXLS on Linux, Mac, Windows using Java with Python

If you opt for the Java version of EasyXLS, a similar code as above requires Py4J, Pyjnius or any other bridge between Python and Java.

Step 1: Download and install EasyXLS Excel Library for Java

To download the trial version of EasyXLS Excel Library, press the below button:

Download EasyXLS™ Excel Library for Java

If you already own a license key, you may login and download EasyXLS from your account.

Step 2: Install Py4j

For the Py4j installation you need to run "pip" command as it follows. Pip is a package-management system used to install and manage software packages written in Python.
<Python installation path>\Scripts>pip install "py4j.whl"

Step 3: Create additional Java program

The following Java code needs to be running in the background prior to executing the Python code.

import py4j.GatewayServer;

public class GatewayServerApp {
  public static void main(String[] args) {
    GatewayServerApp app = new GatewayServerApp();
    // app is now the gateway.entry_point
    GatewayServer server = new GatewayServer(app);
    server.start();
  }
}


Step 4: Add py4j library to CLASSPATH

py4j.jar must be added to your classpath of the additional Java program. py4j.jar can be found after installing Py4j, in "<Python installation path>\share\py4j" folder.

Step 5: Add EasyXLS library to CLASSPATH

EasyXLS.jar must be added to your classpath of the additional Java program. EasyXLS.jar can be found after installing EasyXLS, in "Lib" folder.

Step 6: Run additional Java program

Start the gateway server application and it will implicitly start Java Virtual Machine as well.

Step 7: Run Python code that imports data from Excel to ResultSet

Execute a code as below Python code that imports Excel data to ResultSet.

"""------------------------------------------------------------
Tutorial 34

This tutorial shows how to import Excel to ResultSet in Python.
The data is imported from the active sheet of the Excel file
(the Excel file generated in Tutorial 09).
------------------------------------------------------------"""

import gc

from py4j.java_gateway import JavaGateway
from py4j.java_gateway import java_import 
gateway = JavaGateway()

java_import(gateway.jvm,'EasyXLS.*')
java_import(gateway.jvm,'java.io.FileInputStream')

print("Tutorial 34\n-----------\n")

# Create an instance of the class that imports Excel files
workbook = gateway.jvm.ExcelDocument()

# Import Excel file to ResultSet
print("Reading file C:\\Samples\\Tutorial09.xlsx.\n")

file = gateway.jvm.FileInputStream("C:\\Samples\\Tutorial09.xlsx")
rs = workbook.easy_ReadXLSXActiveSheet_AsResultSet(file)

# Display imported ResultSet values
columnCount = rs.getMetaData().getColumnCount()

row = 0
while rs.next():
    for column in range(columnCount):
        print("At row " + str(row + 1) + ", column " + str(column+1) +
              " the value is '" + rs.getString(column+1) + "'")
    row=row+1

# Dispose memory
gc.collect()

EasyXLS Excel libraries:

.NET
Excel Library for Python .NET
full .NET version to import, export or convert Excel files
Excel Reader for Python .NET
.NET version to read and import Excel files
-
Java
Excel Library for Python Java
full Java version to import, export or convert Excel files
Excel Reader for Python Java
Java version to read and import Excel files
Download EasyXLS™ Excel Library for Python

File formats:

MS Excel 97 - 2003
MS Excel 2007 - 2010
MS Excel 2013
MS Excel 2016
MS Excel 2019
XLSXXLSMXLSBXLS
XMLHTMLCSVTXT