Choosing Java vs Python on Google App Engine

Choosing Java vs Python on Google App Engine



Currently Google App Engine supports both Python & Java. Java support is less mature. However, Java seems to have a longer list of libraries and especially support for Java bytecode regardless of the languages used to write that code. Which language will give better performance and more power? Please advise. Thank you!



Edit:
http://groups.google.com/group/google-appengine-java/web/will-it-play-in-app-engine?pli=1



Edit:
By "power" I mean better expandability and inclusion of available libraries outside the framework. Python allows only pure Python libraries, though.





now Google App Engine is supporting Go (experimental). What's your toughts about that ?
– Benjamin Crouzier
Jul 15 '11 at 10:52





@pinouchon I've started using Go, and deployed that in production on GAE. GO works very well on GAE, compiles in a few seconds. Choose your web framework wisely :-)
– Michele Giuseppe Fadda
Nov 25 '15 at 20:41




15 Answers
15



I'm biased (being a Python expert but pretty rusty in Java) but I think the Python runtime of GAE is currently more advanced and better developed than the Java runtime -- the former has had one extra year to develop and mature, after all.



How things will proceed going forward is of course hard to predict -- demand is probably stronger on the Java side (especially since it's not just about Java, but other languages perched on top of the JVM too, so it's THE way to run e.g. PHP or Ruby code on App Engine); the Python App Engine team however does have the advantage of having on board Guido van Rossum, the inventor of Python and an amazingly strong engineer.



In terms of flexibility, the Java engine, as already mentioned, does offer the possibility of running JVM bytecode made by different languages, not just Java -- if you're in a multi-language shop that's a pretty large positive. Vice versa, if you loathe Javascript but must execute some code in the user's browser, Java's GWT (generating the Javascript for you from your Java-level coding) is far richer and more advanced than Python-side alternatives (in practice, if you choose Python, you'll be writing some JS yourself for this purpose, while if you choose Java GWT is a usable alternative if you loathe writing JS).



In terms of libraries it's pretty much a wash -- the JVM is restricted enough (no threads, no custom class loaders, no JNI, no relational DB) to hamper the simple reuse of existing Java libraries as much, or more, than existing Python libraries are similarly hampered by the similar restrictions on the Python runtime.



In terms of performance, I think it's a wash, though you should benchmark on tasks of your own -- don't rely on the performance of highly optimized JIT-based JVM implementations discounting their large startup times and memory footprints, because the app engine environment is very different (startup costs will be paid often, as instances of your app are started, stopped, moved to different hosts, etc, all trasparently to you -- such events are typically much cheaper with Python runtime environments than with JVMs).



The XPath/XSLT situation (to be euphemistic...) is not exactly perfect on either side, sigh, though I think it may be a tad less bad in the JVM (where, apparently, substantial subsets of Saxon can be made to run, with some care). I think it's worth opening issues on the Appengine Issues page with XPath and XSLT in their titles -- right now there are only issues asking for specific libraries, and that's myopic: I don't really care HOW a good XPath/XSLT is implemented, for Python and/or for Java, as long as I get to use it. (Specific libraries may ease migration of existing code, but that's less important than being able to perform such tasks as "rapidly apply XSLT transformation" in SOME way!-). I know I'd star such an issue if well phrased (especially in a language-independent way).



Last but not least: remember that you can have different version of your app (using the same datastore) some of which are implemented with the Python runtime, some with the Java runtime, and you can access versions that differ from the "default/active" one with explicit URLs. So you could have both Python and Java code (in different versions of your app) use and modify the same data store, granting you even more flexibility (though only one will have the "nice" URL such as foobar.appspot.com -- which is probably important only for access by interactive users on browsers, I imagine;-).





GWT is primarily a client side technology - you can use it regardless of whether your back end is python or java. You lose a bit of syntactic sugar by having to do rpc over JSON rather than GWT's built in RPC, but if you hate JS and do python it's still worth a look :)
– Peter Recore
Jul 6 '09 at 18:56





There is Pyjamas (pyjs.org) as a Pythonic alternative to GWT - it will take Python code and compile it to Javascript, just as GWT does for Java code.
– Dave Kirby
Apr 14 '10 at 9:26





Just to give a "5 years later" perspective. As a Java Developer I feel like GAE is running an outdated stack. You will not find Java 8 support, (they are running Java 6 as well as legacy Jetty 6 container with Servlet API 2.5), all in all Java Support in GAE is still stuck in 2010. While I love GAE simplicity and Google Powerful Services, I can't recommend GAE for Java until they upgrade its stack.
– Anthony Accioly
Sep 19 '14 at 19:06



Watch this app for changes in Python and Java performance:



http://gaejava.appspot.com/
(edit: apologies, link is broken now. But following para still applied when I saw it running last)



Currently, Python and using the low-level API in Java are faster than JDO on Java, for this simple test. At least if the underlying engine changes, that app should reflect performance changes.





With all due respect, I find this test simple enough to be meaningless. For what it is worth... If you do use Java/GAE, I recommend using the Low level API and avoid JDO or any other framework. More importantly, JDO gives you the 'feeling' you're working with a relational database, which can be 'misleading'.
– Mo'in Creemers
Dec 11 '10 at 12:26






I agree about avoiding JDO, partly for the reason you mention but also because it's slower than low-level. (Which the test generally shows.) It simply hints that there are performance differences, so either don't use JDO or test for your specific task. I've moved all my own code from JDO and the low-level API to Objectify. And in any case, it also shows that Python certainly isn't the poor cousin of performance on GAE.
– Richard Watson
Dec 12 '10 at 8:35






Your app, it is throwing Internal Server Error.
– tomdemuyt
Apr 19 '12 at 18:54





Thanks Tom. Not my app, sadly. Have mailed someone that might be linked.
– Richard Watson
Apr 25 '12 at 20:12






i would like to see how objectify compares in this test
– Moshe Shaham
Jan 23 '14 at 13:57



Based on experience with running these VMs on other platforms, I'd say that you'll probably get more raw performance out of Java than Python. Don't underestimate Python's selling points, however: The Python language is much more productive in terms of lines of code - the general agreement is that Python requires a third of the code of an equivalent Java program, while remaining as or more readable. This benefit is multiplied by the ability to run code immediately without an explicit compile step.



With regards to available libraries, you'll find that much of the extensive Python runtime library works out of the box (as does Java's). The popular Django Web framework (http://www.djangoproject.com/) is also supported on AppEngine.



With regards to 'power', it's difficult to know what you mean, but Python is used in many different domains, especially the Web: YouTube is written in Python, as is Sourceforge (as of last week).





Thank you Judy2K! By power I mean can do more things and easy to extend.
– Viet
Jul 6 '09 at 9:59



June 2013: This video is a very good answer by a google engineer:



http://www.youtube.com/watch?v=tLriM2krw2E



TLDR; is:



An important question to consider in deciding between Python and Java is how you will use the datastore in each language (and most other angles to the original question have already been covered quite well in this topic).



For Java, the standard method is to use JDO or JPA. These are great for portability but are not very well suited to the datastore.



A low-level API is available but this is too low level for day-to-day use - it is more suitable for building 3rd party libraries.



For Python there is an API designed specifically to provide applications with easy but powerful access to the datastore. It is great except that it is not portable so it locks you into GAE.



Fortunately, there are solutions being developed for the weaknesses listed for both languages.



For Java, the low-level API is being used to develop persistence libraries that are much better suited to the datastore then JDO/JPA (IMO). Examples include the Siena project, and Objectify.



I've recently started using Objectify and am finding it to be very easy to use and well suited to the datastore, and its growing popularity has translated into good support. For example, Objectify is officially supported by Google's new Cloud Endpoints service. On the other hand, Objectify only works with the datastore, while Siena is 'inspired' by the datastore but is designed to work with a variety of both SQL databases and NoSQL datastores.



For Python, there are efforts being made to allow the use of the Python GAE datastore API off of the GAE. One example is the SQLite backend that Google released for use with the SDK, but I doubt they intend this to grow into something production ready. The TyphoonAE project probably has more potential, but I don't think it is production ready yet either (correct me if I am wrong).



If anyone has experience with any of these alternatives or knows of others, please add them in a comment. Personally, I really like the GAE datastore - I find it to be a considerable improvement over the AWS SimpleDB - so I wish for the success of these efforts to alleviate some of the issues in using it.



I'm strongly recommending Java for GAE and here's why:



I'm using Java/GAE in development right now.





Thanks Paul for recommendation.
– Viet
Jul 8 '09 at 1:31





@Paul - could you recommend (or give links to) the best way to handle persistence using Java on GAE if JDO is not the way to go?
– Mark
Aug 30 '10 at 12:25





@Mark, I'm sorry for delay. I think code.google.com/p/objectify-appengine is the best choose for now.
– Paul
Sep 7 '10 at 14:32





-1 for the outright falsehoods in points #2 and #3 and for #4 not making any sense.
– Triptych
Mar 8 '11 at 16:10





@Paul if you wanted me to consider those things as part of your answer, you should have included them in your answer. Instead, you including a string of half-truths. No one chooses a language based on the corner cases you're coming up with now.
– Triptych
Mar 9 '11 at 17:53



As you've identified, using a JVM doesn't restrict you to using the Java language. A list of JVM languages and links can be found here. However, the Google App Engine does restrict the set of classes you can use from the normal Java SE set, and you will want to investigate if any of these implementations can be used on the app engine.



EDIT: I see you've found such a list



I can't comment on the performance of Python. However, the JVM is a very powerful platform performance-wise, given its ability to dynamically compile and optimise code during the run time.



Ultimately performance will depend on what your application does, and how you code it. In the absence of further info, I think it's not possible to give any more pointers in this area.





Thanks for the prompt reply, Brian. I intend to focus application on url fetching and XML parsing & XSLT processing. There will be less of serving HTTP content to browsers.
– Viet
Jul 6 '09 at 8:16



I've been amazed at how clean, straightforward, and problem free the Python/Django SDK is. However I started running into situations where I needed to start doing more JavaScript and thought I might want to take advantage of the GWT and other Java utilities. I've gotten just half way through the GAE Java tutorial, and have had one problem after another: Eclipse configuration issues, JRE versionitis, the mind-numbing complexity of Java, and a confusing and possibly broken tutorial. Checking out this site and others linked from here clinched it for me. I'm going back to Python, and I'll look into Pyjamas to help with my JavaScript challenges.



I'm a little late to the conversation, but here are my two cents. I really had a hard time choosing between Python and Java, since I am well versed in both languages. As we all know, there are advantages and disadvantages for both, and you have to take in account your requirements and the frameworks that work best for your project.



As I usually do in this type of dilemmas, I look for numbers to support my decision. I decided to go with Python for many reasons, but in my case, there was one plot that was the tipping point. If you search "Google App Engine" in GitHub as of September 2014, you will find the following figure:



GAE Language Stats



There could be many biases in these numbers, but overall, there are three times more GAE Python repositories than GAE Java repositories. Not only that, but if you list the projects by the "number of stars" you will see that a majority of the Python projects appear at the top (you have to take in account that Python has been around longer). To me, this makes a strong case for Python because I take in account community adoption & support, documentation, and availability of open-source projects.



It's a good question, and I think many of the responses have given good view points of pros and cons on both sides of the fence. I've tried both Python and JVM-based AppEngine (in my case I was using Gaelyk which is a Groovy application framework built for AppEngine). When it comes to performance on the platform, one thing I hadn't considered until it was staring me in the face is the implication of "Loading Requests" that occur on the Java side of the fence. When using Groovy these loading requests are a killer.



I put a post together on the topic (http://distractable.net/coding/google-appengine-java-vs-python-performance-comparison/) and I'm hoping to find a way of working around the problem, but if not I think I'll be going back to a Python + Django combination until cold starting java requests has less of an impact.



Based on how much I hear Java people complain about AppEngine compared to Python users, I would say Python is much less stressful to use.





I heard that Ford owners are complaining about their cars a lot more than Koenigsegg owners. Why could that be?
– Axarydax
Dec 17 '10 at 13:13



There's also project Unladen Swallow, which is apparently Google-funded if not Google-owned. They're trying to implement a LLVM-based backend for Python 2.6.1 bytecode, so they can use a JIT and various nice native code/GC/multi-core optimisations. (Nice quote: "We aspire to do no original work, instead using as much of the last 30 years of research as possible.") They're looking for a 5x speed-up to CPython.



Of course this doesn't answer your immediate question, but points towards a "closing of the gap" (if any) in the future (hopefully).





Thanks :) I'll have a look at that!
– Viet
Aug 30 '09 at 6:36





Unladen Swallow is now a dead project and the last commit is over a year old.
– Tshepang
Nov 9 '11 at 17:37



The beauty of python nowdays is how well it communicates with other languages. For instance you can have both python and java on the same table with Jython. Of course jython even though it fully supports java libraries it does not support fully python libraries. But its an ideal solution if you want to mess with Java Libraries. It even allows you to mix it with Java code with no extra coding.



But even python itself has made some steps forwared. See ctypes for example, near C speed , direct accees to C libraries all of this without leaving the comfort of python coding. Cython goes one step further , allowing to mix c code with python code with ease, or even if you dont want to mess with c or c++ , you can still code in python but use statically type variables making your python programms as fast as C apps. Cython is both used and supported by google by the way.



Yesterday I even found tools for python to inline C or even Assembly (see CorePy) , you cant get any more powerful than that.



Python is surely a very mature language, not only standing on itself , but able to coooperate with any other language with easy. I think that is what makes python an ideal solution even in a very advanced and demanding scenarios.



With python you can have acess to C/C++ ,Java , .NET and many other libraries with almost zero additional coding giving you also a language that minimises, simplifies and beautifies coding. Its a very tempting language.





The question is about java vs python on GAE, which has a lot of restrictions. Hence, your arguments are inapplicable.
– Daniyar
Aug 3 '10 at 4:53





I agree with @Daniyar, that this answer is a bit ( or perhaps totally) off the beat, but I like the answer and this was something I was looking for. Thanks Kilon for sharing this knowledge. May be this was the wrong place, but you certainly did some knowledge sharing. +1 and kudos for that.
– zeFree
Apr 13 '12 at 1:58



Gone with Python even though GWT seems a perfect match for the kind of an app I'm developing. JPA is pretty messed up on GAE (e.g. no @Embeddable and other obscure non-documented limitations). Having spent a week, I can tell that Java just doesn't feel right on GAE at the moment.



One think to take into account are the frameworks you intend yo use. Not all frameworks on Java side are well suited for applications running on App Engine, which is somewhat different than traditional Java app servers.



One thing to consider is the application startup time. With traditional Java web apps you don't really need to think about this. The application starts and then it just runs. Doesn't really matter if the startup takes 5 seconds or couple of minutes. With App Engine you might end up in a situation where the application is only started when a request comes in. This means the user is waiting while your application boots up. New GAE features like reserved instances help here, but check first.



Another thing are the different limitations GAE psoes on Java. Not all frameworks are happy with the limitations on what classes you can use or the fact that threads are not allowed or that you can't access local filesystem. These issues are probably easy to find out by just googling about GAE compatibility.



I've also seen some people complaining about issues with session size on modern UI frameworks (Wicket, namely). In general these frameworks tend to do certain trade-offs in order to make development fun, fast and easy. Sometimes this may lead to conflicts with the App Engine limitations.



I initially started developing working on GAE with Java, but then switched to Python because of these reasons. My personal feeling is that Python is a better choice for App Engine development. I think Java is more "at home" for example on Amazon's Elastic Beanstalk.



BUT with App Engine things are changing very rapidly. GAE is changing itself and as it becomes more popular, the frameworks are also changing to work around its limitations.




Thank you for your interest in this question.
Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).



Would you like to answer one of these unanswered questions instead?

Popular posts from this blog

𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

Edmonton

Crossroads (UK TV series)