Skip to main content

Read Microsoft Excel Document As Relational Table Using Teiid

If you are thinking, who in this age of big data with Hadoop, MongoDB et.al one still bothers to fiddle around with Excel? In reality there are still lot of corporate users out there, do their analytics and reporting using Excel. So, this is still is very important source of data.

From very early versions of Teiid, Teiid supported consuming the data from Microsoft Excel documents, however it always relied upon the "Excel" ODBC driver that came built in Windows platforms. Teiid then used "jdbc-odbc" bridge driver provided by Oracle's JDK to read the data in. Although this solution worked most of the time, it was very tedious and sprinkled with issues and limited the access to Windows platform.

In latest version of Teiid 8.7, a new translator is introduced which is based Apache POI framework, which gives platform independent way to read Excel documents. This is worked out to be much more simple and also gave a way define the metadata for the data in Excel Documents.

If you are interested, I wrote a step-by-step example for this here https://community.jboss.org/wiki/MicrosoftExcelDocumentIntoRelationalTable

Ramesh..

Comments

  1. great! this will help a lot. great also for replacing odbc method... sometimes it's necessary (sometimes temporarily... for months) use xls spreadsheets to speed up things, while a proper app is developed. At least for us. Thanks for caring :)

    ReplyDelete

Post a Comment

Popular posts from this blog

Teiid Spring Boot 1.7.0 Released

Teiid Spring Boot version 1.7.0 to support Teiid 16.0 has been released. This release is mainly to support the Teiid's latest version.  In this release, the support for OpenAPI code generation based on VDB has been removed as there is no community interest and moreover it was at OpenAPI 2.0, and the industry has moved to 3.0 and beyond. There are no plans to further pursue this feature. VDB maven plugin was also removed, which was intended to be a replacement for the VDB importing feature was to be used when working on OpenShift, however, since it requires the Maven repository and does not completely represent the feature as defined on the WildFly based deployments this is also removed. You can still use the VDB import feature with Teiid Spring Boot, simply define the VDB with your "IMPORT DATABASE" statements and provide the additional files along with the main VDB file. During the start of the application, Teiid Spring Boot will load all the necessary DDL files for the

Teiid 8.13.3 Released

Teiid 8.13.3 is now  available .  In total 8.13.3 addresses 10 issues since 8.13.2: [ TEIID-4028 ] - adding salesforce-34 resource adapter does not work through the cli [ TEIID-4066 ] - Odata translator ClassNotFoundException: com.sun.ws.rs.ext.RuntimeDelegateImpl [ TEIID-4070 ] - Issues with resource adapters with api modules in wildfly [ TEIID-4089 ] - Teiid JDBC driver does not reset the update count when calling getMoreResults(int) [ TEIID-4093 ] - OData authentication fails with NPE when gss-pattern related properties are included in VDB [ TEIID-4096 ] - AssertionError with independent side of a dependent join that has an ordered limit [ TEIID-3050 ] - allow for more incremental insert with iterator [ TEIID-4075 ] - Netezza translator to support common table expressions [ TEIID-4098 ] - Always preserve columns order in google spreadsheets models [ TEIID-4046 ] - OData - $skip is beeing ignor

Expose Excel Data as OData feed using Teiid

For that matter you can expose any data source that is supported by Teiid through its translators as OData service. You can convert Relational databases XML, JSON, CSV files Web Services  Sales Force Excel documents sources, or write your on top of your specific source. Teiid added support for exposing its virtual database (VDB) as OData service without any additional work. Create a virtual database with your required sources and create any additional views and deploy in the the JBoss EAP 6.1 server, that has Teiid 8.3 or greater installed.  See https://docs.jboss.org/author/display/teiid84final/OData+Support  for more information. So, lets build a simple example of exposing the Excel Sheet. Teiid accesses Excel sheets via JDBC-ODBC bridge, and Windows OS provides a ODBC driver for Excel documents.  Edit the standalone-teiid.xml, and create Connection Factory jdbc:odbc:Driver={Microsoft Excel Driver (*.xls)};Dbq=c:\ODBC\ExcelData.xls odbc