How to use convolutional neural network on binary image using Keras?

up vote
0
down vote

favorite

I am trying to train a cnn model for ocr using keras. I preprocessed the images by converting to grayscale, removing noise and then converting it to binary, as binary images work better in ocr. But the problem I am getting is that binary image has 2 dimensions and no channel dimension and conv2d in keras(well any conv layer in general) require 3 dimensions. So what should I do to add a dimension but keep image binary? I am using cv2 for image processing so please tell solutions using that preferably. Also tell me whether I am right that using binary image dataset is better for ocr.

asked Nov 9 at 9:51

Shantanu Shinde

change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04

@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10

according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24

@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54

yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01

|
show 3 more comments

up vote
0
down vote

favorite

asked Nov 9 at 9:51

Shantanu Shinde

change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04

@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10

according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24

@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54

yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01

|
show 3 more comments

up vote
0
down vote

favorite

asked Nov 9 at 9:51

Shantanu Shinde

python opencv image-processing keras conv-neural-network

asked Nov 9 at 9:51

Shantanu Shinde

asked Nov 9 at 9:51

Shantanu Shinde

asked Nov 9 at 9:51

Shantanu Shinde

asked Nov 9 at 9:51

Shantanu Shinde

asked Nov 9 at 9:51

Shantanu Shinde

change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04

@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10

according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24

@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54

yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01

|
show 3 more comments

change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04

@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10

according to the docs: "When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"." So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24

@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54

yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01

change the dnn architecture to only use one channel. Or add redudant channels, but this will make your model unnecessarily complex.
– Micka
Nov 9 at 10:04

@Micka but the conv2d layer of keras requires 3 input dimensions. How can I change that? As for adding redundant channel how to add that?
– Shantanu Shinde
Nov 9 at 10:10

according to the docs:

"When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"."

So I think you could use input_shape=(height,width,1) for your grayscale or binary data? Sorry, from my side it is only theoretical. And I don't know how to duplicate channels or sth. in python.
– Micka
Nov 9 at 10:24

according to the docs:

"When using this layer as the first layer in a model, provide the keyword argument input_shape (tuple of integers, does not include the batch axis), e.g. input_shape=(128, 128, 3) for 128x128 RGB pictures in data_format="channels_last"."

@Micka I am using binary, not grayscale
– Shantanu Shinde
Nov 9 at 10:54

yes, but it will be used as grayscale. The important thing is, that it is only 1 channel. That's the 1 in input_shape=(height,width,1)
– Micka
Nov 9 at 11:01

|
show 3 more comments

1 Answer
1

active

oldest

votes

up vote
0
down vote

accepted

I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-

img = np.expand_dims(img,axis=2)

answered Nov 9 at 14:45

Shantanu Shinde

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53223398%2fhow-to-use-convolutional-neural-network-on-binary-image-using-keras%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

up vote
0
down vote

accepted

I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-

img = np.expand_dims(img,axis=2)

answered Nov 9 at 14:45

Shantanu Shinde

add a comment |

up vote
0
down vote

accepted

I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-

img = np.expand_dims(img,axis=2)

answered Nov 9 at 14:45

Shantanu Shinde

add a comment |

up vote
0
down vote

accepted

I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-

img = np.expand_dims(img,axis=2)

answered Nov 9 at 14:45

Shantanu Shinde

I got my solution. I used numpy function numpy.expand_dims() to add empty dimension. so it became (width,height,1). Here is what I did:-

img = np.expand_dims(img,axis=2)

answered Nov 9 at 14:45

Shantanu Shinde

answered Nov 9 at 14:45

Shantanu Shinde

answered Nov 9 at 14:45

Shantanu Shinde

answered Nov 9 at 14:45

Shantanu Shinde

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

Some of your past answers have not been well-received, and you're in danger of being blocked from answering.

Please pay close attention to the following guidance:

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Dfyjkt