Another Cassandra data manipulation api

Another Cassandra data manipulation api - PlayOrm

Recently i have found one interesting project on Github named PlayOrm, which features very impress me. I have decided to just play with it. Lets first check out there features list:

Just added support for Entity has a Cursor instead of List which is lazy read to prevent out of memory on VERY wide rows
PlayOrm Queries use way less resources from cassandra cluster than CQL queries
Scalabla JQL(SJQL) supported which is modified JQL that scales(SQL doesn't scale well)
Partitioning so you can query a one trillion row table in just ms with SJQL(Scalable Java Query Language)
Typical query support of <=, <, >, >= and = and no limitations here
Typical query support of AND and OR as well as parenthesis
Inner Join support (Must keep your very very large tables partitioned so you get very fast access times here)
Left Outer Join support
Return Database cursor on query
OneToMany, ManyToMany, OneToOne, and ManyToOne but the ToMany's are nosql fashion not like RDBMS
support of a findAll(Class c, List keys) as is typical in nosql to parallel the reads
Inheritance class heirarchy in one table is supported like hibernate
flush() support - We protect you from failures!!!
first level read cache
Automatically creates ColumnFamilies at runtime
Includes it's own in-memory database for TDD in your unit tests!!!!!
Saves you MORE data storage compared to other solutionst
logging interface below the first level cache so you can see the raw
operations on cassandra and optimize just like when you use hibernate's
logging
A raw interface using only BigDecimal, BigInteger, and String types
which is currently used to upload user defined datasets through a web
interface(and we wire that into generating meta data so they can ad-hoc
query on the nosql system)
An ad-hoc query interface that can query on any table that was from
an Entity object. To us on other tables, you can also code up and save
DboTableMeta objects and the ad-hoc query interface gets you query
support into those tables
IF you have some noSQL data and some Relational data, store your
relational data in noSQL now and just maintain one database in
production!!!
support for joda-time LocalDateTime, LocalDate, LocalTime which
works way better than java's Date object and is less buggy than java's
Date and Calendar objects
Command Line tool.

Impressive yah )) Feature 4 can Partitioning can replace Cassandra composite primary key feature. What i gave done - just clone the project from the git hub. Import the project in IntelliJ idea and start coding.

First i made a try to feature Inner Join.

1) Start my local Cassandra data base.
2) Create an Keyspace named MyKeyspace through CQL as follows:

CREATE KEYSPACE MyKeyspace WITH strategy_class='SimpleStrategy'
 AND strategy_options:replication_factor=1;

3) Create two simple java Pojo with PlayOrm annotations:

Entity log  - one to one relation with Entity event
@NoSqlEntity
@NoSqlQuery(name="findlog", query="select *  FROM Log as l INNER JOIN l.event as ee where l.user=:user")
public class Log {
    @NoSqlId
    private int id;
    //private String
    private String msg;
    @NoSqlIndexed
    private String user;
    @NoSqlTransient
    private Date   time;
    @NoSqlIndexed
    @NoSqlOneToOne
    private Event event;
    public int getId() {
        return id;
    }
    public void setId(int id) {
        this.id = id;
    }
    public String getMsg() {
        return msg;
    }
    public void setMsg(String msg) {
        this.msg = msg;
    }
    public String getUser() {
        return user;
    }
    public void setUser(String user) {
        this.user = user;
    }
    public Date getTime() {
        return time;
    }
    public void setTime(Date time) {
        this.time = time;
    }
    public Event getEvent() {
        return event;
    }
    public void setEvent(Event event) {
        this.event = event;
    }
}

Entity Event

import com.alvazan.orm.api.base.anno.NoSqlEntity;
import com.alvazan.orm.api.base.anno.NoSqlId;
import com.alvazan.orm.api.base.anno.NoSqlIndexed;
@NoSqlEntity
public class Event {
    @NoSqlId
    private int id;
    @NoSqlIndexed
    private String code;
    private String name;
    //private Log log;
    public int getId() {
        return id;
    }
    public void setId(int id) {
        this.id = id;
    }
    public String getCode() {
        return code;
    }
    public void setCode(String code) {
        this.code = code;
    }
    public String getName() {
        return name;
    }
    public void setName(String name) {
        this.name = name;
    }
}

for quick start better to use PlayOrm FactorySingleton which you can found it the test package

package com.alvazan.test;
import java.util.HashMap;
import java.util.Map;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import com.alvazan.orm.api.base.Bootstrap;
import com.alvazan.orm.api.base.DbTypeEnum;
import com.alvazan.orm.api.base.NoSqlEntityManagerFactory;
public class FactorySingleton {
private static final Logger log = LoggerFactory.getLogger(FactorySingleton.class);
private static NoSqlEntityManagerFactory factory;

public static Config getConfigForAllTests() {
/**************************************************
* FLIP THIS BIT TO CHANGE FROM CASSANDRA TO ANOTHER ONE
**************************************************/
String clusterName = "Test Cluster";
//DbTypeEnum serverType = DbTypeEnum.IN_MEMORY;
    DbTypeEnum serverType = DbTypeEnum.CASSANDRA;
String seeds = "localhost:9160";

return new Config(serverType, clusterName, seeds);
}

public synchronized static NoSqlEntityManagerFactory createFactoryOnce() {
if(factory == null) {
Config config = getConfigForAllTests();
//We used this below commented out seeds to test our suite on a cluster of 6 nodes to see if any issues pop up with more
//nodes using the default astyanax consistency levels which I believe for writes and reads are both QOURUM
//which is perfect for us as we know we will get the latest results
//String seeds = "a1.bigde.nrel.gov:9160,a2.bigde.nrel.gov:9160,a3.bigde.nrel.gov:9160";
Map<string object="object"< props = new HashMap<string object="object">();
factory = createFactory(config, props);
}
return factory;
}

public static NoSqlEntityManagerFactory createFactory(Config config, Map<string object="object"> props) {
log.info("CREATING FACTORY FOR TESTS");
props.put(Bootstrap.AUTO_CREATE_KEY, "create");
switch (config.getServerType()) {
case IN_MEMORY:
//nothing to do
break;
case CASSANDRA:
Bootstrap.createAndAddBestCassandraConfiguration(props, config.getClusterName(), "MyKeyspace", config.getSeeds());
break;
default:
throw new UnsupportedOperationException("not supported yet, server type="+config.getServerType());
}

NoSqlEntityManagerFactory factory = Bootstrap.create(config.getServerType(), props, null, null);
return factory;
}
}

Now it's time to put some data on Cassandra and write Managed query

package com.alvazan.test;

import com.alvazan.orm.api.base.NoSqlEntityManager;
import com.alvazan.orm.api.base.NoSqlEntityManagerFactory;
import com.alvazan.orm.api.base.Query;
import com.alvazan.test.db.Email;
import com.alvazan.test.db.User;
import com.alvazan.test.mytest.Event;
import com.alvazan.test.mytest.Log;

import java.util.ArrayList;
import java.util.Date;
import java.util.List;
public class BasicTest {
    public static void main(String[] args) {
        // create connection factory
        NoSqlEntityManagerFactory factory = FactorySingleton.createFactoryOnce();
        NoSqlEntityManager mgr = factory.createEntityManager();
        Event event = new Event();
        event.setCode("SID0001");
        event.setId(1);
        event.setName("Validation failed");
        Log log = new Log();
        log.setId(1);
        log.setTime(new Date(System.currentTimeMillis()));
        log.setMsg("test");
        log.setUser("weblogic");
        log.setEvent(event);
  
        mgr.put(log);
        mgr.flush();
        // query
        Query query = mgr.createNamedQuery(Log.class, "findlog");
        query.setParameter("user","weblogic");
        List l = query.getResultList(0,100);
        System.out.println("Result Size: "+ l.size());
}

For partitioning query you have to defined managed query similarly

PARTITIONS e(:partitionId) select * FROM TABLE as e WHERE e.user = :user

Most of the example with Cassandra you will found on the com.alvazan.test package. At first glance the framework is very impressive with lot of unique features. For me it will be useful to reindex or create new index from existing data through map/reduce. This feature is in their up coming features list. I will be very happy to see the feature in next version.

Quick start with In memory Data Grid, Apache Ignite

UP1: For complete quick start guide, see also the sample chapter of the book "High performance in-memory computing with Apache Ignite" here . Even you can find the sample examples from the GitHub repository . IMDG or In memory data grid is not an in-memory relational database, an NoSQL database or a relational database. It is a different breed of software datastore. The data model is distributed across many servers in a single location or across multiple locations. This distribution is known as a data fabric. This distributed model is known as a ‘shared nothing’ architecture. IMDG has following characteristics: All servers can be active in each site. All data is stored in the RAM of the servers. Servers can be added or removed non-disruptively, to increase the amount of RAM available. The data model is non-relational and is object-based. Distributed applications written on the platform independent language. The data fabric is resilient, allowing non-disruptive au...

My workspace

Search This Blog

Another Cassandra data manipulation api - PlayOrm

Labels

Comments

Popular posts from this blog

Apache Ignite deep dive, SQL engine

Book review: High Performance in-memory computing with Apache Ignite by Sadruddin Md

Quick start with In memory Data Grid, Apache Ignite