How to use convolutional neural network on binary image using Keras?









up vote
0
down vote

favorite
1












I am trying to train a cnn model for ocr using keras. I preprocessed the images by converting to grayscale, removing noise and then converting it to binary, as binary images work better in ocr. But the problem I am getting is that binary image has 2 dimensions and no channel dimension and conv2d in keras(well any conv layer in general) require 3 dimensions. So what should I do to add a dimension but keep image binary? I am using cv2 for image processing so please tell solutions using that preferably. Also tell me whether I am right that using binary image dataset is better for ocr.










share|improve this question





















  • change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
    – Micka
    Nov 9 at 10:04










  • @Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
    – Shantanu Shinde
    Nov 9 at 10:10










  • according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
    – Micka
    Nov 9 at 10:24











  • @Micka I am using binary, not grayscale
    – Shantanu Shinde
    Nov 9 at 10:54










  • yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
    – Micka
    Nov 9 at 11:01














up vote
0
down vote

favorite
1












I am trying to train a cnn model for ocr using keras. I preprocessed the images by converting to grayscale, removing noise and then converting it to binary, as binary images work better in ocr. But the problem I am getting is that binary image has 2 dimensions and no channel dimension and conv2d in keras(well any conv layer in general) require 3 dimensions. So what should I do to add a dimension but keep image binary? I am using cv2 for image processing so please tell solutions using that preferably. Also tell me whether I am right that using binary image dataset is better for ocr.










share|improve this question





















  • change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
    – Micka
    Nov 9 at 10:04










  • @Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
    – Shantanu Shinde
    Nov 9 at 10:10










  • according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
    – Micka
    Nov 9 at 10:24











  • @Micka I am using binary, not grayscale
    – Shantanu Shinde
    Nov 9 at 10:54










  • yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
    – Micka
    Nov 9 at 11:01












up vote
0
down vote

favorite
1









up vote
0
down vote

favorite
1






1





I am trying to train a cnn model for ocr using keras. I preprocessed the images by converting to grayscale, removing noise and then converting it to binary, as binary images work better in ocr. But the problem I am getting is that binary image has 2 dimensions and no channel dimension and conv2d in keras(well any conv layer in general) require 3 dimensions. So what should I do to add a dimension but keep image binary? I am using cv2 for image processing so please tell solutions using that preferably. Also tell me whether I am right that using binary image dataset is better for ocr.










share|improve this question













I am trying to train a cnn model for ocr using keras. I preprocessed the images by converting to grayscale, removing noise and then converting it to binary, as binary images work better in ocr. But the problem I am getting is that binary image has 2 dimensions and no channel dimension and conv2d in keras(well any conv layer in general) require 3 dimensions. So what should I do to add a dimension but keep image binary? I am using cv2 for image processing so please tell solutions using that preferably. Also tell me whether I am right that using binary image dataset is better for ocr.







python opencv image-processing keras conv-neural-network






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 9 at 9:51









Shantanu Shinde

57




57











  • change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
    – Micka
    Nov 9 at 10:04










  • @Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
    – Shantanu Shinde
    Nov 9 at 10:10










  • according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
    – Micka
    Nov 9 at 10:24











  • @Micka I am using binary, not grayscale
    – Shantanu Shinde
    Nov 9 at 10:54










  • yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
    – Micka
    Nov 9 at 11:01
















  • change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
    – Micka
    Nov 9 at 10:04










  • @Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
    – Shantanu Shinde
    Nov 9 at 10:10










  • according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
    – Micka
    Nov 9 at 10:24











  • @Micka I am using binary, not grayscale
    – Shantanu Shinde
    Nov 9 at 10:54










  • yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
    – Micka
    Nov 9 at 11:01















change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04




change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04












@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10




@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10












according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24





according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24













@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54




@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54












yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01




yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01












1 Answer
1






active

oldest

votes

















up vote
0
down vote



accepted










I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-



img = np.expand_dims(img,axis=2)





share|improve this answer




















    Your Answer






    StackExchange.ifUsing("editor", function ()
    StackExchange.using("externalEditor", function ()
    StackExchange.using("snippets", function ()
    StackExchange.snippets.init();
    );
    );
    , "code-snippets");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "1"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53223398%2fhow-to-use-convolutional-neural-network-on-binary-image-using-keras%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    0
    down vote



    accepted










    I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-



    img = np.expand_dims(img,axis=2)





    share|improve this answer
























      up vote
      0
      down vote



      accepted










      I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-



      img = np.expand_dims(img,axis=2)





      share|improve this answer






















        up vote
        0
        down vote



        accepted







        up vote
        0
        down vote



        accepted






        I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-



        img = np.expand_dims(img,axis=2)





        share|improve this answer












        I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-



        img = np.expand_dims(img,axis=2)






        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 9 at 14:45









        Shantanu Shinde

        57




        57



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.





            Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


            Please pay close attention to the following guidance:


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53223398%2fhow-to-use-convolutional-neural-network-on-binary-image-using-keras%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

            Edmonton

            Crossroads (UK TV series)