aws-re:invent-2017

8 results

Production Ready IoT with MongoDB Stitch and Electric Imp: Part 1

It's that time of year again! This post is part of our Road to AWS re:Invent 2017 blog series. See all posts here. Introduction We’ve been tinkering with hardware a lot at MongoDB lately, and with this has come a lot of trial and error and first hand learnings about how hard building real IoT is. While sending a message or getting a light to blink is easy, the jump between that and building something production ready can be daunting. Really, what makes IoT hard is: Building, testing, and modifying hardware (especially if you’re only used to software) Establishing and maintaining a secure and scalable connection to the cloud in real-world environments Standing up a full backend to process, persist, and analyze IoT data Integrating all the other services you use without writing a lot of boring integration code Making sure all aspects of the application are robust and secure Even though MongoDB is the best database for IoT, there’s still a lot more to getting your project off the ground. We’ve found the following to be a killer combination to getting your end-to-end deployment running smoothly: Electric Imp – Flexible development boards with pre-integrated software, a polished IDE, and managed, secure cloud connectivity. MongoDB Stitch – A full backend as a service with integrations with your favorite services and built-in JavaScript functions. MongoDB Atlas – The best database for persisting and analyzing your IoT data. In the following tutorial we’re going to put all these together: Use an impExplorer™ Developer Kit and the Electric Imp platform to read the temperature and send it to Stitch. Stitch combines the temperature readings with more general weather data from Dark Sky, stores it in Atlas, and protects your data with field-level access rules. Stitch also integrates with Twilio so you can retrieve temperature data from the database via text. Getting Started with your Electric Imp Device For this tutorial you will need to pick up Electric Imp’s impExplorer™ Developer Kit . This is what you will use to read the temperature and send it to our backend. Before you get set up, you’ll need to make an account for Electric Imp’s IDE and download the Electric Imp app (on an iOS or Android phone). Your account is where the device code will live, and you’ll use the app to link your device to your account. Setting up your imp To start, unpack your impExplorer. It should include a board, a card (a WiFi IoT module), and a USB power cord. Start by inserting the card into the board and powering it on by attaching the provided cable to your board and then plugging it into a USB port on your computer or a USB wall adaptor. If you are having trouble or want more information, see the impExplorer guide . After you power on your board, use the Electric Imp app to ‘BlinkUp’ your board. This connects it to the internet and assigns it to your account so that you can deploy code to it. If you wait too long to configure your board – you know this when the card’s LED stops flashing – simply restart it and try again. If you have issues blinking up your impExplorer you can check out Electric Imp’s troubleshooting guide . Deploying the Code Now that you’ve got your device blinked up, you’ll want to build out your device code quickly. Electric Imp actually breaks down the code into two components: Device code reads inputs from the sensors on the impExplorer Agent code ensures that the device communicates reliably and securely with the internet. We’ve provided code that covers both the device and agent code for the impExplorer. You can find them with our other Github samples . Once you have downloaded the code, sign into the Electric Imp IDE . From there you’ll need to do the following: Create a model and link your Device : After signing in, the impExplorer device should appear under devices as an unassigned device. Create a model by going to Models and clicking the +. Assign your device to the new model (and rename your device if you choose). Create a model and link your Device : After signing in, the impExplorer device should appear under devices as an unassigned device. Create a model by going to Models and clicking the +. Assign your device to the new model (and rename your device if you choose). If you have any issues with the above process, see the tour of the Imp IDE . Can’t wait for hardware and don’t mind cheating a little bit? We’ve got a node script that fakes its place of the demo. But it’s so much more fun with the actual device ! Connecting your Device to Stitch Now you have a device that can measure and send the temperature, but nowhere to send it to. Luckily, with MongoDB Stitch you can quickly setup an entire backend. Getting Started with Atlas To start, if you don’t already have a MongoDB Atlas account then you’ll want to register for an account and create an Atlas cluster . Create your Stitch Application Once you have Atlas up and running, you can create a Stitch app to cover our backend connections and logic. To create an app, you can click on “Stitch Apps” on the left hand nav of the UI and then click “Create New Application” or follow the more in depth instructions on how to create a MongoDB Stitch app . With Stitch, all of your data is protected by default, so you’ll have to do the following to grant your device the ability to write through Stitch: Create an API Key so the device authenticate to your Stitch application Put Rules in place such that devices can read/write data properly Create a function for the device to execute when it wants to write data Setting up Authentication To start, we’re going to enable API key authentication in the application. To do this: First go to the Stitch UI From there click on ‘Authentication’ in the left hand menu Find ‘API Keys’ under ‘Providers’ and click ‘Edit’ Enable API Key authentication, and then create an API key (the name doesn’t matter). Note : You can only view a key once, so if you lose this key you’ll have to generate a new one. After you create and copy the key, go back to the Electric Imp IDE and add your App ID (found on the Stitch ‘Getting Started’ or ‘Clients’ page) and you API key to the code as shown below. Note : In order to fully roll out this code you will need to ‘Build’ it by clicking the ‘Build’ button at the top of the IDE, and you may need to restart your device as well. Now your device will be able to make authenticated requests to Stitch. However, you will still need to create rules and a Stitch Function to get everything working properly. Creating Rules One important thing to note with Stitch is that even when you’re authenticated, all access to your data and services is off by default. In order to access anything you must use Stitch’s rules to enable access. Therefore, in order for our function that writes to Stitch to work properly, you will need to set a Write rule for the namespace. In order to do this: Click on your MongoDB Atlas cluster under ‘Atlas Clusters’ on the left-hand menu. In the ‘Rules’ tab, click ‘New’ to add a new collection to your rules, and enter ‘Imp’ for the database and the ‘TempData’ for the collection. Click the created collection to edit the following details: Click the ‘Filter’ tab and delete the existing ‘Filter’ Select ‘Top-Level Document’ to adjust the rules for the namespace as a whole Set the write rule to {} Delete the read rule entirely, leaving an empty box. By doing this, you enable the device (and only the device) to write data and prevent all reads to the data by devices. Creating a Stitch Function for Writing Now that your device has a user and permissions associated with it, it’s time to define the function that it will use to write to the database. Check out the code in the ‘Imp_Write.js’ file. This contains the JavaScript function that the ImpExplorer is calling in its agent code. With Functions, this code can be hosted and executed by Stitch. To add it to your application: In the Stitch UI, click ‘Functions’ on the lefthand menu Click ‘New Function’ In the ‘Settings’ tab, set your Function Name to “Imp_Write” and your ‘Can Evaluate’ rule to {}. Go to the ‘Function Editor’ tab and paste the code from Imp_Write.js into the editor. Then click the save icon on the upper left-hand corner of the Editor. Now that you’ve got the basic write functionality set up, let’s look at how Stitch’s built-in services and functions can help broaden the scope of your app. Building out your Backend In this section we’ll show you how you can integrate with Dark Sky to use its public weather API to incorporate additional data and with Twilio to search our data via text message. Additional Weather data with Dark Sky One of Dark Sky’s many services is providing a public API that serves real-time weather data. Here, you will use this service to pair real-time weather data with our device’s temperature and humidity readings as they are loaded. If you don’t have a Dark Sky account you can register for one here . Once you register, you’ll need to copy down your API key as you will incorporate it into Stitch shortly. To start, you will set up two values in Stitch. You can hard code things like this in your functions, but using Values is prefered as it makes them accessible for reuse. In the Stitch UI, go to ‘Values’ found on the left-hand menu Create two new values, clicking ‘Save’ after each is set up: DeviceLocation: Enter the longitude/latitude of where your device is located in the form "40.757,-73.987" – example for Times Square in New York City. DarkSkyKey: Enter the API key that you received from Dark Sky. Next, create a HTTP service to make the actual Dark Sky request. To do this: In the Stitch UI, click ‘Services’, then click ‘HTTP’, and name your service ‘darksky’ before creating After creation, click into the ‘Rules’ tab, and then click ‘New’. The name of this rule does not matter. To create a rule that enables GET requests, click on the ‘GET’ action and set this rule to {}. Since the code that you will be using to call the Dark Sky API is already in in the Imp_Write Function, all you need to do now is uncomment the lines that call Dark Sky. They should be the only commented out lines containing code. Now, when your impExplorer loads data Stitch will automatically combine it with data from the Dark Sky API. Searching your Data with Twilio Now let’s take a look at serving your data. Since you are using MongoDB Atlas behind the scenes you can always connect to your database and query it directly. However, today we’ll also show you how you can query your database via Twilio + Stitch. If you don’t have a Twilio account and an SMS-enabled phone number, you can learn more and create one here . Once you have your Twilio account set up, keep your Account SID and Auth Token handy, as you will use them to configure your Twilio service. To do this: Go to the Stitch UI and click on ‘Services’ Click ‘Twilio’, assign the service the name ‘twilio’, and then add your SID and Token. Once your Twilio service is configured, set up a rule that enables Stitch to send text messages. To do this: Click into the ‘Rules’ tab of your Twilio service Click ‘New Rules’, assign it any name that you like, and then click ‘Add Rule’ Now click into the rule and click the ‘send’ to enable it. You will also need to store your Twilio phone number as a Value named ‘TwilioPhone’ in Stitch. To do this: In the Stitch UI, Navigate to ‘Values’ using the left-hand menu. Click ‘New’ value and add your Twilio number in the form “+15558675309” before clicking ‘Save’. After that is taken care of you are going to set up an incoming webhook that will parse a text message from Twilio, use it to search the database, and then send a return text message with weather information for a specific time. This code is contained in the twilio_webhook.js file. To add this code to Stitch: In the Stitch UI, navigate to your Twilio service Under the ‘Incoming Webhooks’ tab click ‘+ Add Incoming Webhook’ Under the ‘settings’ tab assign your webhook a name (exact naming is not important) and enable ‘Respond with Result’ by clicking on it Then, in the Function Editor, paste the code from the twilio_find.js file and click the save icon in the upper left-hand corner. Before moving on, copy down the ‘Webhook URL’ under the ‘Settings’ tab. You’ll need to add this to Twilio in our final step. Now that the webhook is set up in Stitch, make sure that Twilio will call it correctly. In order to do this: Go to your Twilio Console and navigate to ‘Phone Numbers’ using the left-hand menu. Click the number that you are using for this demo, then go to the ‘Messaging’ section at the bottom of the page. Make sure that ‘Configure With’ is set to ‘Webhooks,...’ and that when ‘A Message Comes In’ is configured to execute your Twilio webhook by selecting ‘Webhook’ and pasting your Webhook URL. Save any changes before navigating away. Now Twilio will only be able to access data via the webhook that it is assigned, meaning that it doesn’t change how secure your app is. Test everything out! Now that you’ve got everything up and running, try querying your data via Twilio by sending a message to your Twilio number in one of the following formats: Send “Temp” or “Temp Now” to get the current temperature. Or search for a specific time by sending “Temp [Time]” where [Time] is in the format “YYYY-MM-DDThh:mm” (ex. “Temp 2017-11-27T11:20”) Summary Both MongoDB Stitch and Electric Imp offer you a big leg up when getting your IoT project off the ground. Now that you’ve gone through the basics with Stitch and Electric Imp, you can move onto some of our other projects like building a real-time dashboard or build your next great idea. As always, we’re interested to hear about what you’re building. Please continue the conversation by commenting on this post or send me a note at drew.dipalma@mongodb.com to share what you’re working on!

November 28, 2017

Using Amazon Lex, Lambda, & MongoDB Atlas to Build a Voice-Activated Movie Search App - Part 3

It's that time of year again! This post is part of our Road to AWS re:Invent 2017 blog series. In the weeks leading up to AWS re:Invent in Las Vegas this November, we'll be posting about a number of topics related to running MongoDB in the public cloud. See all posts here. Introduction This is Part 3 of our Amazon Lex blog post series, part of our larger Road to re:Invent 2017 series . As a reminder, this tutorial is divided into 3 parts: Part 1: Lex overview, demo scenario and data layer setup Part 2: Set up and test an Amazon Lex bot Part 3: Deploy a Lambda function as our Lex bot fulfillment (this blog post) In this last blog post, we will deploy our Lambda function using the AWS Command Line Interface and verify that the bot fully works as expected. We’ll then review the code that makes up our Lambda function and explain how it works. Let’s deploy our AWS Lambda function Please follow the deployment steps available in this GitHub repository . I have chosen to use Amazon’s SAM Local tool to showcase how you can test your Lambda function locally using Docker , as well as package it and deploy it to an AWS account in just a few commands. However, if you’d like to deploy it manually to the AWS Console, you can always use this zip script to deploy it in pretty much the same way I did in this MongoDB Atlas with Lambda tutorial . Let’s test our Lex bot (end-to-end) Now that our Lambda fulfillment function has been deployed, let’s test our bot again in the Amazon Lex console and verify that we get the expected response. For instance, we might want to search for all the romance movies Jennifer Aniston starred in, a scenario we can test with the following bot conversation: As the screenshot above testifies, the Lex bot replies with the full list of Jennifer Aniston’s romance movies retrieved from our movies MongoDB database through our Lambda function. But how does our Lambda function process that request? We’ll dig deeper into our Lambda function code in the next section. Let's dive into the Lambda function code Our Lambda function always receives a JSON payload with a structure compliant with Amazon Lex’ input event format (as this event.json file is): { "messageVersion": "1.0", "invocationSource": "FulfillmentCodeHook", "userId": "user-1", "sessionAttributes": {}, "bot": { "name": "SearchMoviesBot", "alias": "$LATEST", "version": "$LATEST" }, "outputDialogMode": "Text", "currentIntent": { "name": "SearchMovies", "slots": { "castMember": "jennifer aniston", "year": "0", "genre": "Romance" } } } Note that the request contains the bot’s name ( SearchMoviesBot ) and the slot values representing the answers to the bot’s questions provided by the user. The Lambda function starts with the exports.handler method which validates the bot’s name and performs some additional processing if the payload is received through Amazon API Gateway (this is only necessary if you want to test your Lambda function through Amazon API Gateway but is not relevant in an Amazon Lex context). It then calls the dispatch() method, which takes care of connecting to our MongoDB Atlas database and passing on the bot’s intent to the query() method, which we’ll explore in a second. Note that the dispatch() method uses the performance optimization technique I highlighted in Optimizing AWS Lambda performance with MongoDB Atlas and Node.js , namely not closing the database connection and using the callbackWaitsForEmptyEventLoop Lambda context property. This allows our bot to be more responsive after the first query fulfilled by the Lambda function. Let’s now take a closer look at the query() method, which is the soul and heart of our Lambda function. First, that method retrieves the cast member, movie genre, and movie release year. Because these values all come as strings and the movie release year is stored as an integer in MongoDB, the function must convert that value to an integer . We then build the query we will run against MongoDB: var castArray = [castMember]; var matchQuery = { Cast: { $in: castArray }, Genres: { $not: { $in: ["Documentary", "News", ""] } }, Type: "movie" }; if (genre != undefined && genre != allGenres) { matchQuery.Genres = { $in: [genre] }; msgGenre = genre.toLowerCase(); } if ((year != undefined && isNaN(year)) || year > 1895) { matchQuery.Year = year; msgYear = year; } We first restrict the query to items that are indeed movies (since the database also stores TV series) and we exclude some irrelevant movie genres such as the documentary and news genres. We also make sure we only query movies in which the cast member starred. Note that the $in operator expects an array, which is why we have to wrap our unique cast member into the castArray array. Since the cast member is the only mandatory query parameter, we add it first and then optionally add the Genres and Year parameters if the code determines that they were provided by the user (i.e. the user did not use the All and/or 0 escape values). The query() method then goes on to define the default response message based on the user-provided parameters. This default response message is used if the query doesn’t return any matching element: var resMessage = undefined; if (msgGenre == undefined && msgYear == undefined) { resMessage = `Sorry, I couldn't find any movie for ${castMember}.`; } if (msgGenre != undefined && msgYear == undefined) { resMessage = `Sorry, I couldn't find any ${msgGenre} movie for ${castMember}.`; } if (msgGenre == undefined && msgYear != undefined) { resMessage = `Sorry, I couldn't find any movie for ${castMember} in ${msgYear}.`; } if (msgGenre != undefined && msgYear != undefined) { resMessage = `Sorry, ${castMember} starred in no ${msgGenre} movie in ${msgYear}.`; } The meat of the query() method happens next as the code performs the database query using 2 different methods: the classic db.collection.find() method and the db.collection.aggregate() method. The default method used in this Lambda function is the aggregate one, but you can easily test the find() method by setting the * aggregationFramewor*k variable to false . In our specific use case scenario (querying for one single cast member and returning a small amount of documents), there likely won’t be any noticeable performance or programming logic impact. However, if we were to query for all the movies multiple cast members each starred in (i.e. the union of these movies, not the intersection), the aggregation framework query is a clear winner. Indeed, let’s take a closer look at the find() query the code runs: cursor = db.collection(moviesCollection) .find(matchQuery, { _id: 0, Title: 1, Year: 1 }) .collation(collation) .sort({ Year: 1 }); It’s a fairly simple query that retrieves the movie’s title and year, sorted by year. Note that we also use the same { locale: "en", strength: 1 } collation we used to create the case-insensitive index on the Cast property in Part 2 of this blog post series . This is critical since the end user might not title case the cast member’s name (and Lex won’t do it for us either). The simplicity of the query is in contrast to the relative complexity of the app logic we have to write to process the result set we get with the find() method: var maxYear, minYear; for (var i = 0, len = results.length; i < len; i++) { castMemberMovies += `${results[i].Title} (${results[i].Year}), `; } //removing the last comma and space castMemberMovies = castMemberMovies.substring(0, castMemberMovies.length - 2); moviesCount = results.length; var minYear, maxYear; minYear = results[0].Year; maxYear = results[results.length-1].Year; yearSpan = maxYear - minYear; First, we have to iterate over all the results to concatenate its Title and Year properties into a legible string. This might be fine for 20 items, but if we had to process hundreds of thousands or millions of records, the performance impact would be very noticeable. We further have to remove the last period and white space characters of the concatenated string since they’re in excess. We also have to manually retrieve the number of movies, as well as the low and high ends of the movie release years in order to compute the time span it took the cast member to shoot all these movies. This might not be particularly difficult code to write, but it’s clutter code that affects app clarity. And, as I wrote above, it definitely doesn’t scale when processing millions of items. Contrast this app logic with the succinct code we have to write when using the aggregation framework method: for (var i = 0, len = results.length; i < len; i++) { castMemberMovies = results[i].allMovies; moviesCount = results[i].moviesCount; yearSpan = results[i].timeSpan; } The code is not only much cleaner and concise now, it’s also more generic, as it can handle the situation where we want to process movies for each of multiple cast members. You can actually test this use case by uncommenting the following line earlier in the source code : castArray = [castMember, "Angelina Jolie"] and by testing it using this SAM script . With the aggregation framework, we get the correct raw and final results without changing a single line of code: However, the find() method’s post-processing requires some significant effort to fix this incorrect output (the union of comedy movies in which Angelina Jolie or Brad Pitt starred in, all incorrectly attributed to Brad Pitt): We were able to achieve this code conciseness and correctness by moving most of the post-processing logic to the database layer using a MongoDB aggregation pipeline : cursor = db.collection(moviesCollection).aggregate( [ { $match: matchQuery }, { $sort: { Year: 1 } }, unwindStage, castFilterStage, { $group: { _id: "$Cast", allMoviesArray: {$push: {$concat: ["$Title", " (", { $substr: ["$Year", 0, 4] }, ")"] } }, moviesCount: { $sum: 1 }, maxYear: { $last: "$Year" }, minYear: { $first: "$Year" } } }, { $project: { moviesCount: 1, timeSpan: { $subtract: ["$maxYear", "$minYear"] }, allMovies: { $reduce: { input: "$allMoviesArray", initialValue: "", in: { $concat: [ "$$value", { $cond: { if: { $eq: ["$$value", ""] }, then: "", else: ", " } }, "$$this" ] } } } } } ], {collation: collation} ); This aggregation pipeline is arguably more complex than the find() method discussed above, so let’s try to explain it one stage at a time (since an aggregation pipeline consists of stages that transform the documents as they pass through the pipeline): $match stage : performs a filter query to only return the documents we’re interested in (similarly to the find() query above). $sort stage : sorts the results by year ascending. $unwind stage : splits each movie document into multiple documents, one for each cast member in the original document. For each original document, this stage unwinds the Cast array of cast members and creates separate, unique documents with the same values as the original document, except for the Cast property which is now a string value (equal to each cast member) in each unwinded document. This stage is necessary to be able to group by only the cast members we’re interested in (especially if there are more than one). The output of this stage may contain documents with other cast members irrelevant to our query, so we must filter them out in the next stage. $match stage : filters the deconstructed documents from the $unwind stage by only the cast members we’re interested in. This stage essentially removes all the documents tagged with cast members irrelevant to our query. $group stage : groups movies by cast member (for instance, all movies with Brad Pitt and all movies with Angelina Jolie, separately). This stage also concatenates each movie title and release year into the Title (Year) format and adds it to an array called allMoviesArray (one such array for each cast member). This stage also computes a count of all movies for each cast member, as well as the earliest and latest year the cast member starred in a movie (of the requested movie genre, if any). This stage essentially performs most of the post-processing we previously had to do in our app code when using the find() method. Because that post-processing now runs at the database layer, it can take advantage of the database server’s computing power along with the distributed system nature of MongoDB (in case the collection is partitioned across multiple shards, each shard performs this stage independently of the other shards). $project stage : last but not least, this stage performs a $reduce operation (new in MongoDB 3.4) to concatenate our array of ‘ Title (Year) ’ strings into one single string we can use as is in the response message sent back to the bot. Once the matching movies have been retrieved from our MongoDB Atlas database, the code generates the proper response message and sends it back to the bot according to the expected Amazon Lex response format : if (msgGenre != allGenres) { resMessage = `${toTitleCase(castMember)} starred in the following ${moviesCount>1?moviesCount+" ":""} ${msgGenre.toLowerCase()} movie(s)${yearSpan>0?" over " + yearSpan +" years":""}: ${castMemberMovies}`; } else { resMessage = `${toTitleCase(castMember)} starred in the following ${moviesCount>1?moviesCount+" ":""}movie(s)${yearSpan>0?" over " + yearSpan +" years":""}: ${castMemberMovies}`; } if (msgYear != undefined) { resMessage = `In ${msgYear}, ` + resMessage; callback( close(sessionAttributes, "Fulfilled", { contentType: "PlainText", content: resMessage }) ); Our Jennifer Aniston fan can now be wowed by the completeness of our bot's response! Wrap-up and next steps This completes our Lex blog post series and I hope you enjoyed reading it as much as I did writing it. In this final blog post, we tested and deployed a Lambda function to AWS using the SAM Local tool . We also learned: How a Lambda function processes a Lex request and responds to it using Amazon Lex’ input and out event format . How to use a case-insensitive index in a find() or aggregate() query How to make the most of MongoDB’s aggregation framework to move complexity from the app layer to the database layer As next steps, I suggest you now take a look at the AWS documentation to learn how to deploy your bot to Facebook Messenger , Slack or to your own web site . Happy Lex-ing! About the Author - Raphael Londner Raphael Londner is a Principal Developer Advocate at MongoDB, focused on cloud technologies such as Amazon Web Services, Microsoft Azure and Google Cloud Engine. Previously he was a developer advocate at Okta as well as a startup entrepreneur in the identity management space. You can follow him on Twitter at @rlondner

November 26, 2017

Using Amazon Lex, Lambda, & MongoDB Atlas to Build a Voice-Activated Movie Search App - Part 2

November 13, 2017

The User Guide to AWS re:Invent

This post is a mini-guide that walks through some of the things to do while you are at AWS re:Invent this year.

November 13, 2017

Using Amazon Lex, Lambda, & MongoDB Atlas to Build a Voice-Activated Movie Search App - Part 1

November 12, 2017

Building a NodeJS App with MongoDB Atlas and AWS Elastic Container Service - Part 2

In my last post, we started preparing an application built on Node.js and MongoDB Atlas for simple CRUD operations. We've completed the initial configuration of the code and are now ready to launch this into production.

November 9, 2017

Predictions for AWS re:Invent 2017 (tl;dr: AI & IoT)

This post is the second installment of our Road to AWS re:Invent 2017 blog series. In the weeks leading up to AWS re:Invent in Las Vegas this November, we'll be posting about a number of topics related to running MongoDB in the public cloud. See all posts here. In just under two months, more than 46,000 technologists will descend on Las Vegas for this year’s AWS re:Invent . Ranging from seasoned members of the AWS community to the cloud-curious, re:Invent attendees should expect the conference’s sixth iteration to deliver the same parade of ecosystem partners, an extensive agenda focused on moving to (and being successful in) AWS cloud, and the inevitable announcement of a fresh batch of new AWS services. In attempting to predict what this year’s re:Invent keynote will unveil, we’ll look at how the industry has changed since last November, as well as Amazon’s track record for debuting new products at past re:Invents. Since last year’s conference, the two most significant shifts in the space are underpinned by the two largest trends of the moment: AI and IoT. It is safe to assume that we will see an augmentation of AWS’s artificial intelligence and machine learning offerings next month. Last year’s conference brought us Lex , Polly , and Rekognition as Amazon made its entrée into advanced text, voice, and image processing. Widespread adoption of this flavor of artificial intelligence is still modest, so these releases may have been overshadowed by seemingly more relevant tools like Athena, which allows users to run SQL-based queries on data stored in S3. Nonetheless, the development of its AI portfolio is of strategic importance for AWS. Despite being the most popular public cloud, Amazon has faced increasing pressure from Azure and Google Cloud Platform. The latter has been able to differentiate itself among the early-adopter community primarily for its more mature AI offerings. To remain dominant over Google in the space, Amazon must prove able to keep up with the same pace of innovation in this sector. The areas that appear most ripe for innovation from AWS this year are in voice, image, and video analysis. Already, we have seen success among e-commerce players when using text and image-based search to shorten their conversion cycles. In fact, Gartner reports that voice-based search is the fastest growing mobile search type . The opportunity to exploit users’ devices for image and voice-based search is evident in Amazon’s offerings (Alexa, Amazon iOS/Android app). Furthermore, the explosion of intelligent chat-based interfaces (Messenger, Drift, etc.) has increased the demand for a broader set of capabilities in natural language processing services like Lex. As a result, we should be prepared to see further enhancements to Lex, Polly, and Rekognition. Video remains the one area of machine learning-based processing AWS has yet to touch. As their image analysis engines improve, the next logical step would be for the low-latency processing of video inputs. With the untold volume of video content being generated every day by ever-improving cameras, it stands to reason that organizations will want to turn that into insight and profit. These first two predictions hint at another group of potential releases we could see from AWS next month. The development of extensible models for the analysis of text, voice, image, and video is predicated on the accessibility of high quality, low-cost microphones and cameras. While smartphones have supported these inputs for more than a decade now, the availability of WiFi and reliable cellular networks has increased the speed and frequency by which their outputs can be shared or uploaded for further analysis. So, that brings us to our next theme: the Internet of Things. Many analysts and skeptics have suggested IoT adoption is weak and its promises are over-hyped . Their skepticism is primarily centered on two ongoing challenges with IoT: 1) the lack of one or two emergent platforms on which IoT technologies can standardize and 2) the relatively limited ability for data from decentralized sensors to be analyzed at “the edge” rather than in a central cloud. As with operating systems, media encodings, or network protocols, mass adoption of the technologies they support is typically preceded by one to three main players emerging as the default options. AWS entered the competition to build the winning IoT platform at re:Invent 2015 with its announcement of AWS IoT . All other major technology companies have made similar bids for dominance of this market. In addition, there are hundreds of venture-funded startups aiming to serve as a universal platform untethered from an existing “marketecture.” Nevertheless, the fact remains that no winner in this race has yet been crowned. This remains a large opportunity and Amazon is well-poised with its existing portfolio of software and ecosystem of networking and hardware partners. AWS appeared to renew its commitment to capturing the IoT market at last year’s re:Invent with the debut of AWS Greengrass and Lambda@Edge . Greengrass allows for the running of Lambda functions on local, offline devices rather than in Amazon’s cloud. Lambda@Edge is one of AWS’s first forays into “edge computing,” allowing users to run low-latency and device-specific Node.js functions in their “edge locations”. Both releases mark a shift from centralized cloud computing to distributed edge computing—perhaps less comfortable for AWS, but necessary for sometimes-offline or time-sensitive IoT projects. However, Greengrass was just the first step to enabling AWS users to better serve disparate, intermittently-connected devices. Notably, Greengrass still requires ML-powered data processing and analysis to take place in the cloud rather than locally (at the edge). Improvements in hardware technology may also prompt AWS to improve their on-device offerings and make services like S3 and DynamoDB available outside of their infrastructure to better store and process sensor data on the devices themselves. Similarly, we may also see devices become a more significant player in more seasoned services like Kinesis, enabling the local ingestion of data. No matter what gets announced on the keynote stage this year, you can rest assured it will lead the conversation for the months that follow.

November 7, 2017

Building a NodeJS App with MongoDB Atlas and AWS Elastic Container Service - Part 1

October 30, 2017