Amazon AWS promises to let analysts do drag-and-drop data cleansing with DataBrew


Amazon today announced it has extended its program for data cleansing, known as Glue, with a visual user interface that automates some steps necessary to prepare data, to simplify the task for non-coders.

Called DataBrew, the program lets data analysts and data scientists carry out the steps known as extract, transform, and load, or ETL, which happen before any data can be analyzed in a data warehouse or another repository. 

Whereas Glue, which was introduced in 2016, was a visual tool for engineers to do ETL with some coding involved, DataBrew is meant for analysts and data scientists to work on the same data cleansing operation by simply clicking buttons and checking off radial boxes in a visual user interface. 

As AWS describes the service, as consisting of “250 pre-built transformations to automate data preparation tasks (e.g. filtering anomalies, standardizing formats, and correcting invalid values) that would otherwise require days or weeks writing hand-coded transformations.”

In a demonstration video, AWS shows how the DataBrew program can, for example, remove special characters in a database entry such as an ampersand, which can’t be used in data analysis. 

Similarly, a text-string can be mapped to numeric values to make the entries analyzable, using a “categorical mapping function.”

So, for example, a column “user type” that includes entries of either “subscriber” or “customer” can be mapped to the values “1” and “2” by clicking the mapping button in the user interface, and clicking a radial button, which produces a new column with the 1 and 2 values corresponding to all the character entries. 

A profiling function offers statistics about the data set, such as the number of missing entries in a dataset. 

The Amazon initiative will presumable provide newfound competition for companies that specialize in data cleansing, such as Talend. 

Amazon said it already had some customers using the software, including Japanese telecom giant NTT DoCoMo and energy giant bp plc.

For further information, there is also a Glue DataBrew blog entry on the product.

Recent Articles

5 Employee Recruitment Strategies for Scouting the Top Candidates – ShoeMoney

The employee recruiting market industry’s worth $19 billion. Why? Employee recruitment’s far beyond hiring. Strong hires come from recruiting, quality interviews, onboarding, and training. Since recruiting’s...

🔥🔥Black Friday🔥⚡️★ QIS.HOST 1GBPS-10GBPS-100GBPS dedicated server.Buy 1 Month Get 1 Month FREE!

QIS.HOST Europe amazing Specials offer !Our Datacenter provide 7 new location with best value 10gbps servers. You can test our network using this...

5 ways to open device manager on Windows 10

Windows 10 comes with a device management tool called Device Manager which can be used for updating and troubleshooting the system devices and...

Related Stories

Leave A Reply

Please enter your comment!
Please enter your name here

Stay on op - Ge the daily news in your inbox

[tdn_block_newsletter_subscribe input_placeholder=”Email address” btn_text=”Subscribe” tds_newsletter2-image=”730″ tds_newsletter2-image_bg_color=”#c3ecff” tds_newsletter3-input_bar_display=”” tds_newsletter4-image=”731″ tds_newsletter4-image_bg_color=”#fffbcf” tds_newsletter4-btn_bg_color=”#f3b700″ tds_newsletter4-check_accent=”#f3b700″ tds_newsletter5-tdicon=”tdc-font-fa tdc-font-fa-envelope-o” tds_newsletter5-btn_bg_color=”#000000″ tds_newsletter5-btn_bg_color_hover=”#4db2ec” tds_newsletter5-check_accent=”#000000″ tds_newsletter6-input_bar_display=”row” tds_newsletter6-btn_bg_color=”#da1414″ tds_newsletter6-check_accent=”#da1414″ tds_newsletter7-image=”732″ tds_newsletter7-btn_bg_color=”#1c69ad” tds_newsletter7-check_accent=”#1c69ad” tds_newsletter7-f_title_font_size=”20″ tds_newsletter7-f_title_font_line_height=”28px” tds_newsletter8-input_bar_display=”row” tds_newsletter8-btn_bg_color=”#00649e” tds_newsletter8-btn_bg_color_hover=”#21709e” tds_newsletter8-check_accent=”#00649e” embedded_form_code=”YWN0aW9uJTNEJTIybGlzdC1tYW5hZ2UuY29tJTJGc3Vic2NyaWJlJTIy” tds_newsletter=”tds_newsletter1″ tds_newsletter3-all_border_width=”2″ tds_newsletter3-all_border_color=”#e6e6e6″ tdc_css=”eyJhbGwiOnsibWFyZ2luLWJvdHRvbSI6IjAiLCJib3JkZXItY29sb3IiOiIjZTZlNmU2IiwiZGlzcGxheSI6IiJ9fQ==” tds_newsletter1-btn_bg_color=”#0d42a2″ tds_newsletter1-f_btn_font_family=”406″ tds_newsletter1-f_btn_font_transform=”uppercase” tds_newsletter1-f_btn_font_weight=”800″ tds_newsletter1-f_btn_font_spacing=”1″ tds_newsletter1-f_input_font_line_height=”eyJhbGwiOiIzIiwicG9ydHJhaXQiOiIyLjYiLCJsYW5kc2NhcGUiOiIyLjgifQ==” tds_newsletter1-f_input_font_family=”406″ tds_newsletter1-f_input_font_size=”eyJhbGwiOiIxMyIsImxhbmRzY2FwZSI6IjEyIiwicG9ydHJhaXQiOiIxMSIsInBob25lIjoiMTMifQ==” tds_newsletter1-input_bg_color=”#fcfcfc” tds_newsletter1-input_border_size=”0″ tds_newsletter1-f_btn_font_size=”eyJsYW5kc2NhcGUiOiIxMiIsInBvcnRyYWl0IjoiMTEiLCJhbGwiOiIxMyJ9″ content_align_horizontal=”content-horiz-center”]